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Abstract. ||We introduce consumption games, a model for discrete interactive 
system with multiple resources that are consumed or reloaded independently. 
More precisely, a consumption game is a finite-state graph where each transition 
is labeled by a vector of resource updates, where every update is a non-positive 
number or to. The uj updates model the reloading of a given resource. Each vertex 
belongs either to player □ or player O, where the aim of player □ is to play so 
that the resources are never exhausted. We consider several natural algorithmic 
problems about consumption games, and show that although these problems are 
computationally hard in general, they are solvable in polynomial time for every 
fixed number of resource types (i.e., the dimension of the update vectors). 



1 Introduction 

In this paper we introduce consumption games, a model for discrete interactive systems 
with multiple resources that can be consumed and reloaded independently. We show that 
consumption games, despite their rich modelling power, still admit efficient algorithmic 
analysis for a "small" number of resource types. This property distinguishes consump- 
tion games from other related models, such as games over vector addition systems or 
multi-energy games (see below), that are notoriously intractable. 

Roughly speaking, a consumption game is a finite-state directed graph where each 
state belongs either to player □ (controller) or player O (environment). Every transition 
s — > t is labelled by a li-dimensional vector 5 such that each component 6(i), where 
1 < 2 < d, is a non-positive integer (encoded in binary) or u>. Intuitively, if 6(i) - -n, 
then the current load of the 2-th resource is decreased by n while performing s — > t, 
and if 6(i) - co, then the 2-th resource can be "reloaded" to an arbitrarily high value 
greater than or equal to the current load. A configuration of a consumption game is 
determined by the current control state and the current load of all resources, which is 
a £/-dimensional vector of positive integers. A play of a consumption game is initiated 
in some state and some initial load of resources. The aim of player □ is to play safely, 
i.e., select transitions in his states so that the vector of current resource loads stays 
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positive in every component (i.e., the resources are never exhausted). Player O aims at 
the opposite. 

The resources may correspond to fuel, electricity, money, or even more abstract 
entities such as time or patience. To get a better intuition behind consumption games 
and the abstract problems studied in this paper, let us discuss one particular example in 
greater detail. 

The public transport company of Brno city0 maintains the network of public trams, 
buses, troleybuses, and boats. Due to the frequent failures and breakdowns in electrical 
wiring, rails, railroad switches, and the transport vehicles themselves, the company has 
several emergency teams which travel from one accident to another according to the 
directives received from the central supervisory service. Recently, the company was 
considering the possibility of replacing their old diesel vans by new cars equipped with 
more ecological natural gas engines. The problem is that these cars have smaller range 
and can be tanked only at selected gas stations. So, it is not clear whether the cars are 
usable at all, i.e., whether they can always visit a gas station on time regardless where 
and when an accident happens, and what are the time delays caused by detours to gas 
stations. Now we indicate how to construct the associated consumption game model 
and how to rephrase the above questions in abstract terms. 

We start with a standard graph G representing the city road network, i.e., the nodes 
of G correspond to distinguished locations (such as crossings) and the edges correspond 
to the connecting roads. Then we identify the nodes corresponding to gas stations that 
sell natural gas, and to each edge (road) we assign two negative numbers corresponding 
to the expected time and fuel needed to pass the road. Every morning, a car leaves a 
central garage (where it is fully tanked) and returns to the same place in the evening. 
The maximal number of accidents serviced per day can be safely overestimated by 12. 
Our consumption game C has two resource types modelling the fuel and time in the 
expected way. The fuel is consumed by passing a transition (road), and can be reloaded 
by the outgoing transitions of gas stations. The time is also consumed by passing the 
roads, and the only node where it can be reloaded is the central garage, but only after 
completing the 12 jobs. In the states of C we remember the current job number (from 1 
to 12) and the current target node. At the beginning, and also after visiting the current 
target node, the next target node is selected by player O. Technically, the current target 
node belongs to player O, and there is a transition for every (potential) next target node. 
Performing such a transition does not consume the resources, but the information about 
the next target node is stored in the chosen state, job index is increased, and the control 
over the play is given back to player □ who models the driver. This goes on until the job 
index reaches 12. Then, player O makes no further choice, but it is possible to reload 
the time resource at the node corresponding to the central garage, and hence player □ 
aims at returning to this place as quickly as possible (without running out of gas). Note 
that C has about 12 • n 2 states, where n is the number of states of G. 

The question whether the new cars are usable at all can now be formalized as fol- 
lows: Is there is safe strategy for player □ in the initial configuration such that the fuel 
resource is never reloaded to a value which is higher than the tank capacity of the car? 
In the initial configuration, the fuel resource is initialized to 1 because it can be immedi- 
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ately reloaded in the central garage, and the time resource is initialized to a "sufficiently 
high value" which is efficiently computable due to the finite reload property formulated 
in Corollary|7] Similarly, the extra time delays caused by detours to gas stations can be 
estimated by computing the minimal initial credit for the time resource, i.e., the min- 
imal initial value sufficient for performing a safe strategy, and comparing this number 
with the minimal initial credit for the time resource in a simplified consumption game 
where the fuel is not consumed at all (this corresponds to an ideal "infinite tank capac- 
ity"). Similarly, one could also analyze the extra fuel costs, or model the consumption 
of the material needed to perform the repairs, and many other aspects. 

An important point of the above example is that the number of resources is relatively 
small, but the number of states is very large. This motivates the study of parametrized 
complexity of basic decision/optimization problems for consumption games, where the 
parameters are the following: 

• d, the number of resources (or dimension); 

• I, the maximal finite \S(i)\ such that 1 < i < d and 6 is a label of some transition. 

Main results. For every state s of a consumption game C, we consider the following 
sets of vectors (see Section|2]for precise definitions): 

• Safe(s) consists of all vectors a of positive integers such that player □ has a safe 
strategy in the configuration {s, a). That is, Safe(s) consists of all vectors describing 
a sufficient initial load of all resources needed to perform a safe strategy. 

• Cover(s) consists of all vectors a of positive integers such that player □ has a safe 
strategy <x in the configuration (s, a) such that for every strategy n for player <> and 
every configuration (t,/3) visited during the play determined by <x and n we have that 
P < a. Note that physical resources (such as fuel, water, electricity, etc.) are stored in 
devices with finite capacity (tanks, batteries, etc.), and hence it is important to know 
what capacities of these devices are sufficient for performing a safe strategy. These 
sufficient capacities correspond to the vectors of Cover(s). 

Clearly, both Safe(s) and Cover(s) are upwards closed with respect to component-wise 
ordering. Hence, these sets are fully determined by their finite sets of minimal elements. 
In this paper we aim at answering the very basic algorithmic problems about Safe(s) and 
Coverts), which are the following: 

(A) Emptiness. For a given state s, decide whether Safe(s) = (or Cover(s) = 0). 

(B) Membership. For a given state s and a vector a, decide whether a e Safe(s) 
(or a e Cover(s)). Further, decide whether a is a minimal vector of Safe(s) (or 
Cover(s)). 

(C) Compute the set of minimal vectors of Safe(s) (or Cover(s)). 

Note that these problems subsume the questions of our motivating example. We show 
that all of these problems are computationally hard, but solvable in polynomial time 
for every fixed choice of the parameters d and £ introduced above. Since the degree of 
the bounding polynomial increases with the size of the parameters, we do not provide 
fixed-parameter tractability results in the usual sense of parametrized complexity (as it 
is mentioned in Section|3] this would imply a solution to a long-standing open problem 
in algorithmic game theory). Still, these results clearly show that for "small" parameter 
values, the above problems are practically solvable even if the underlying graph of C is 
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very large. More precisely, we show the following for game graphs with n states and m 
transitions: 

• The emptiness problems for Safe(s) and Cover(s) are coNP-complete, and solvable 
in 0(d\ ■ n d ■ m) time. 

• The membership problems for Safe(s) and Cover(s) are PSPACE-hard and solvable 
in time \a\ ■ (d ■ £ ■ n) 0(d) and 0(A 2 ■ n 2 ), respectively, where \a\ is the encoding size 
of a and A = I7 d =l a(i). 

• The set of minimal elements of Safe(s) and Cover(s) is computable in time 
{d-C- n) 0(d> and {d ■ t ■ n) 0(ddl \ respectively. 

Then, in Section |H we show that the complexity of some of the above problems 
can be substantially improved for two natural subclasses of one-player and decreasing 
consumption games by employing special methods. A consumption game is one-player 
if all states are controlled by player □, and decreasing if every resource is either reloaded 
or decreased along every cycle in the graph of C. For example, the game constructed in 
our motivating example is decreasing, and we give a motivating example for one-player 
consumption games in Section@] In particular, we prove that 

• the emptiness problem for Safe(s) and Cover(s) is solvable in polynomial time both 
for one-player and decreasing consumption games; 

• the membership problem for Safe(s) is PSPACE-complete (resp. NP-complete) for 
decreasing consumption games (resp. one-player consumption games). 

• Furthermore, for both these subclasses we present algorithms to compute the mini- 
mal elements of Safe(s) by a reduction to minimum multi-distance reachability prob- 
lem, and solving the minimum multi-distance reachability problem on game graphs. 
Though the algorithms do not improve the worst case complexity over general con- 
sumption games, the algorithms are iterative and potentially terminate much earlier 
(we refer to Section l4~3l and Section l4~4l for details). 

Related work. Our model of consumption games is related but incomparable to en- 
ergy games studied in the literature. In energy games both positive and non-positive 
weights are allowed, but in contrast to consumption games there are no w-weights. 
Energy games with single resource type was introduced in [4|, and it was shown that 
the minimal initial credit problem (and also the membership problem) can be solved 
in exponential time. It follows from the results of |4| that the related problem of ex- 
istence of finite initial credit (which is same as the emptiness problem), which was 
also shown to be equivalent to two-player mean-payoff games (2), lie in NP n coNP. 
Games over extended vector addition systems with states (eVASS games), where the 
weights in transition labels are in {-1,0, l,a>], were introduced and studied in Q. It 
this paper, it was shown that the question whether player □ has a safe strategy in a 
given configuration is decidable, and the winning region of player □ is computable 
in cZ-EXPTIME, where d is the eVASS dimensions, and hence the provided solution 
is impractical even for very small d's. A closely related model of energy games with 
multiple resource types (or multi-energy games) were considered in [6]. The minimal 
initial credit problem (and also the membership problem) for multi-energy games can 
be reduced to the problem of games over eVASS with an exponential reduction to en- 
code the integer weights into weights {-1,0,1}. Thus the minimal initial credit problem 
can be solved in af-EXPTIME, and the membership problem is EXPSPACE-hard (the 
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hardness follows from the classical result of Lipton [ 1 1 1). The existence of finite ini- 
tial credit problem (i.e., the emptiness problem) is coNP-complete for multi-energy 
games |6 |. Thus the complexity of the membership and the minimal initial credit prob- 
lem for consumption games is much better (it is in EXPTIME and PSPACE-hard and 
can be solved in polynomial time for every fixed choice of the parameters) as com- 
pared to eVASS games or multi-energy games (EXPSPACE-hard and can be solved 
in &-EXPTIME). For eVASS games with fixed dimensions, the problem can be solved 
in polynomial time for d — 2 (see J5J), and it is open whether the complexity can be 
improved for other constants. Moreover, for the important subclasses of one-player and 
decreasing consumption games we show much better bounds (polynomial time algo- 
rithms for emptiness and optimal complexity bounds for membership in Safe(sJ). 

The paper is organized as follows. After presenting necessary definitions in Sec- 
tion |2 we present our solution to the three algorithmic problems (A)-(C) for general 
consumption games in Section [3] In Section [4] we concentrate on the two subclasses 
of decreasing and one-player consumption games and give optimized solutions to some 
of these problems. Finally, in Section [5] we give a short list of open problems which, 
in our opinion, address some of the fundamental properties of consumption games that 
deserve further attention. Due to the lack of space, the proofs were shifted to Appendix. 

2 Definitions 

In this paper, the set of all integers is denoted by Z. For a given operator m e {>, <, <, >}, 
we use Zmo to denote the set {i e Z | i x 0}, and Z^ () to denote the set Z«o U {a»}, where 
a) t Z is a special symbol representing an "infinite amount" with the usual conventions 
(in particular, c + u- u + c- oj and c < u> for every c e Z). For example, Z<o is the 
set of all negative integers, and Z" is the set Z<o U {a>}. We use Greek letters a,f3, ... to 
denote vectors over Z M( ) or Z^ () , and to denote the vector of zeros. The 2-th component 
of a given a is denoted by a(i). The standard component-wise ordering over vectors is 
denoted by <, and we also write a < ft to indicate that a(i) < f3{i) for every i. 

Let M be a finite or countably infinite alphabet. A word over M is a finite or infinite 
sequence of elements of M. The empty word is denoted by e, and the set of all finite 
words over M is denoted by M*. Sometimes we also use M + to denote the set M* \ {e}. 
The length of a given word w is denoted by len(w), where len(s) = and the length of 
an infinite word is oo. The individual letters in a word w are denoted by w(Q), w(l), . . ., 
and for every infinite word w and every i > we use w, to denote the infinite word 
w(i), w(i+l), 

A transition system is a pair T = (V, — > ), where V is a finite or countably infinite 
set of vertices and — » c V x V a transition relation such that for every v eV there is at 
least one outgoing transition (i.e., a transition of the form v — > u). A path in T is a finite 
or infinite word w over V such that w(i) — > w(/+l) for every < i < len(w). We call a 
finite path a history and infinite path a run. The sets of all finite paths and all runs in T 
are denoted by FPath(T) and Run(T), respectively. 

Definition 1. A (2-player) game is a triple G = (V, (V D , V<>)) where (V, h->) is a 
transition system and (V n , V<>) is a partition ofV. IfV^ = 0, then G is a 1-player game. 
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A game G is played by two players, □ and O, who select transitions in the vertices of 
V n and V<>, respectively. Let O e {□, O}. A strategy for player O is a function which 
to each wv e V*V Q assigns a state v' e V such that v t-» v'. The sets of all strategies 
for player □ and player <> are denoted by Eq and TIq (or just by X and 77 if G is 
understood), respectively. We say that a strategy r is memoryless if r(wv) depends just 
on the last state v. Strategies that are not necessarily memoryless are called history- 
dependent. Note that every initial vertex v and every pair of strategies (cr,n) e I x 77 
determine a unique infinite path in G initiated in v, which is called a play and denoted 
by Play^v). 

Definition!. Let d > I. A consumption game of dimension d is a tuple C = 
(S , E , (S □, S o), L) where S is a finite set of states, (S , E) is a transition system, (S n ,S<y) 
is a partition of S , and L is labelling which to every (s,t) € E assigns a vector 
6 — (<5(1), . . . , 5(d)) such that 6(i) e Z" for every 1 < i < d. If s e 5<>, we require 
that 6(i) + for all 1 < i < d. We write s —* t to indicate that (s, t) 6 E and L(s, t) = 6. 

We say that C is one-player if = 0, and decreasing if for every n > 1, every 
1 < i < d, and every path sq -^-> s t • • • s n such that s Q = s„, there is some j <n 
where 6j(i) + 0. 

Intuitively, if s-^t, then the system modeled by C can move from the state s to the 
state t so that its resources are consumed/reloaded according to 6. More precisely, if 
6(i) < 0, then the current load of resource i is decreased by |<5(i)l. and if S(i) = to, then 
the resource i can be reloaded to an arbitrarily high positive value larger than or equal to 
the current load. The aim of player □ is to play so that the resources are never exhausted, 
i.e., the vector of current loads stays positive in every component. The aim of player O 
is to achieve the opposite. 

The above intuition is formally captured by defining the associated infinite-state 
game Gc for C. The vertices of Gq are configurations of C, i.e., the elements of S x Z^ 
together with a special configuration F (which stands for "fail"). The transition relation 
h-> of Gc is determined as follows: 

• F i-> F. 

• For every configuration (s, a) and every transition s — » t of C such that a(i) + 6(i) > 
for all 1 < i < d, there is a transition (s, a) h-» (f, a+y) for every y e Z d such that 

- y(i) = 6(i) for every 1 < i < d where 5(i) + to; 

- y(i) > for every 1 < i < d where 6(i) = to. 

• If (s, a) is a configuration and s -4 1 a transition of C such that a(i) + 6(i) < for 
some 1 < i < d, then there is a transition (s, a) h-> F. 

• There are no other transitions. 

A strategy cr for player □ in G c is safe in a configuration (s, a) iff for every strategy 
7r for player O we have that Play^is, a) does not visit the configuration F. For every 
s e S , we use 

• Safe(s) to denote the set of all a e Z^ such that player □ has a safe strategy in (s, a); 

• Cover(s) to denote the set of all a e Z^ such that player □ has a safe strategy cr in 
(s, a) such that for every strategy n for player O and every configuration (f,/i) visited 
by Play^is, a) we have that (3 <a. 
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If a 6 Safe(s), we say that a is safe in s, and if a e Cover(s), we say that a is covers s. 
Obviously, Cover(s) c Safe(s), and both Safe(s) and Cover(i) are upwards closed w.r.t. 
component-wise ordering (i.e., if a e Safe(s) and a < a-', then a' e Safe(s)). This means 
that Safe(s) and Cover(s) are fully described by its finitely many minimal elements. 

Intuitively, Safe(s) consists of all vectors describing a sufficiently large initial 
amount of all resources needed to perform a safe strategy. Note that during a play, the 
resources can be reloaded to values that are larger than the initial one. Since physical 
resources are stored in "tanks" with finite capacity, we need to know what capacities of 
these tanks are sufficient for performing a safe strategy. These sufficient capacities are 
encoded by the vectors of Cover(s). 

3 Algorithms for general consumption games 

In this section we present a general solution for the three algorithmic problems (A)-(C) 
given in SectionQ] 

We start by a simple observation that connects the study of consumption games to a 
more mature theory of Streett games. A Streett game is a tuple S = (V, h->, (V n , V<>), 
where (V, (V a , V<>)) is a 2-player game with finitely many vertices, and J?l = 
{(Gi,/?i), . . . , (G m ,R m )}, where m > 1 and G,-,/?,- £ i-> for all 1 < i < m, is a Streett 
(or strong fairness) winning condition (for technical convenience, we consider G,, 
as subsets of edges rather than vertices). For an infinite path w in S, let inf(w) be the 
set of all edges that are executed infinitely often along w. We say that w satisfies J{ iff 
inf(vv) n Gj + implies inf(w) Pi Ri + for every 1 < i < m. A strategy <x e Z s is 
winning in v e V if for every n e 77s we have that Play^^v) satisfies Jl. The problem 
whether player □ has a winning strategy in a vertex v e V is coNP-complete [8|, and 
the problem can be solved in 0(m\ ■ |V|™ • | h-> |) time Ifl2l . 

For the rest of this section, we fix a consumption game C = (S,E,(S n ,S <>),L) of 
dimension d, and we use { to denote the maximal finite \S(J)\ such that 1 < i < d and 6 
is a label of some transition. A proof of the next lemma is given in Appendix lAl 

Lemma 3. Let Sc — (S^^SajS^),^) be a Streett game where !J{ — 
{(G l ,Ry),...,(G d ,R d )l d = {(s,t) £ E | L(s,t)(i) < 0}, and fl, = {(s,t) e E | 
L(s, t)(i) — lo] for every 1 < i < d. Then for every s e S the following assertions 
hold: 

1. IfSafe(s) + 0, then player □ has a winning strategy in s in the Streett game Sc- 

2. If player □ has a winning strategy in s in the Streett game Sc, then 
(d\ ■ \S\ ■{+ \,...,d\ ■ \S\ ■{ + 1) € Safe(s) n Cover(s). 

An immediate consequence of Lemma[3]is that Safe(s) — iff Cover(s) = 0. Our next 
lemma shows that the existence of a wining strategy in Streett games is polynomially 
reducible to the problem whether Safe(s) — in consumption games. The proof of 
Lemma|4]is similar to the proof of Lemma|3] 

Lemma 4. Let S = (V, h->, (V n , V<>), J?l) be a Streett game where J?l = 
{(Gj , R\), . . . , (Cm, Rm)}- Let Cs - (V, (V n , V<y), L) be a consumption game of di- 
mension m where L(u,v)(i) is either —I, a>, or 0, depending on whether (u,v) e G;, 
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(m, v) e Rj, or (u, v) £ G,- U Ri, respectively. Then for every v e V we have that player □ 
has a winning strategy in v (in S) iffSafe(v) + {in Cs)- 

A direct consequence of Lemma[3]and Lemma|4]is the following: 

Theorem 5. The emptiness problems for Safe(s) and Cover(s) are coNV-complete and 
solvable in 0(d\ ■ \S\ d ■ \E\) time. 

Also observe that if managed to prove that the emptiness problem for Safe(s) or 
Cover(s) is fixed-parameter tractable in d for consumption games where the maximal 
decrement £ is equal to one (i.e., if we proved that the problem is solvable in time 
Fid) ■ n° m where n is the size of the game and F a computable function), then due 
to Lemma [4] we would immediately obtain that the problem whether player □ has a 
winning strategy in a given Streett game is also fixed-parameter tractable. That is, we 
would obtain a solution to one of the longstanding open problems of algorithmic game 
theory. 

Now we show how to compute the set of minimal elements of Safe(s). A key obser- 
vation is the following lemma whose proof is non-trivial. 

Lemma 6. For every s e S and every minimal a e Safe(s) we have that a(i) < d ■ ( • \S \ 
for every 1 < i < d. 

Observe that Lemma|6]does not follow from Lemma[3](2.). Intuitively, Lemma[3](2.) 
only says that if all resources are loaded enough, then there is a safe strategy. However, 
we aim at proving a substantially stronger result saying that no resource needs to be 
reloaded to more than d-l- \S \ regardless how large is the current load of other resources. 

Intuitively, Lemma|6]is obtained by a somewhat tricky inductive argument where we 
first consider all resources as being "sufficiently large" and then bound the components 
one by one. Since a similar technique is also used to compute the minimal elements of 
Cover(s), we briefly introduce the main underlying notions and ideas. 

An abstract load vector n is an element of (Z" ) d . The precision of n is the number 
of components different from co. The standard componentwise ordering is extended also 
to abstract load vectors by stipulating that c < a> for every c eZ. Given an abstract load 
vector n and a vector a e (Z>o) d , we say that a matches n if a(j) - for all 1 < j < d 
such that n(j) + cj. Finally, we say that n is compatible with Safe(s) (or Cover(s)) if 
there is some a e Safe(s) (or a € Cover(s)) that matches n. 

The proof of Lemma|6]is obtained by showing that for every minimal abstract load 
vector jt with precision i compatible with Safe(s) we have that n(J) < i • € • \S | for every 
1 < j < d such that n{j) + u. Since the minimal elements of Safe(s) are exactly the 
minimal abstract vectors of precision d compatible with Safe(s), we obtain the desired 
result. The claim is proven by induction on ;. In the induction step, we pick a minimal 
abstract vector ji with precision i compatible with s, and choose a component j such that 
n( j) - a). Then we show that if we replace n(j) with some k whose value is bounded by 
(i + 1) ■ t • \S\, we yield a minimal compatible abstract vector with precision i + 1, The 
proof of this claim is the very core of the whole argument, and it involves several subtle 
observations about the structure of minimal abstract load vectors. The details are given 
in Appendix iBl 
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An immediate consequence of Lemma [6] is that if Safe(s) + 0, then 
(d ■ t ■ \S |, . . . , d ■ i ■ \S |) e Safe(s) D Cover(s), which improves the bound of 
Lemma[3](2.)- Another important consequence of Lemma|6]is the following: 

Corollary 7 (Finite reload property). If a e Safe(s) and /?(/) = min{a(0, d \S \}for 
every 1 < i < d, then (3 € Safe(s). 

Due to Corollary [7] for every minimal a e Safe(s) there is a safe strategy which never 
reloads any resource to more than d ■ ( ■ \S\. Another consequence of Corollary [7] is 
that one can reduce the problem of computing the minimal elements of Safe(s) to the 
problem of determining a winning set in a finite-state 2-player safety game with at most 
\S | • d d ■ ( d ■ \S \ d + 1 vertices, which is obtained from C by storing the vector of current 
resource loads explicitly in the states. Whenever we need to reload some resource, it 
can be safely reloaded to d ■ € ■ \S |, and we simulate this reload be the corresponding 
transition. Since the winning set in a safety game with n states and m edges can be 
computed in time linear in n + m 119111 , we obtain the following: 

Corollary 8. The sets of all minimal elements of all Safe(s) are computable in time 
(d-( -\S\) 0{d \ 

The complexity bounds for the algorithmic problems (B) and (C) for Safe(s) are given 
in our next theorem. The proofs of the presented lower bounds are given in AppendixICl 

Theorem 9. Let a e Z d and s e S. 

• The problem whether a € Safe(s) is PSPACE-hard and solvable in time 
\a\ ■ (d ■ I • \S \) 0( - d \ where \a\ is the encoding size of a. 

• The problem whether a is a minimal vector ofSafe(s) is PSPACE-hard and solvable 
in time \a\ ■ (d ■ ( ■ \S \)°^ d \ where \a\ is the encoding size of a. 

• The set of all minimal vectors ofSafe(s) is computable in time (d ■ I • \S \) ^ 

Now we provide analogous results for Cover(s). Note that deciding the membership 
to Cover(s) is trivially reducible to the problem of computing the winning region in a 
finite-sate game obtained from C by constraining the vectors of current resource loads 
by a. Computing the minimal elements of Cover(s) is more problematic. One is tempted 
to conclude that all components of the minimal vectors for each Cover(s) are bounded 
by a "small" number, analogously to Lemma [6] In this case, we obtained only the fol- 
lowing bound, which is still polynomial for every fixed d and {, but grows double- 
exponentially in d. The question whether this bound can be lowered is left open, and 
seems to require a deeper insight into the structure of covering vectors. 

Lemma 10. For every s e S and every minimal a e Cover(s) we have that a(i) < 
(d-e-\S\) 2 " for every \ < i < d. 

The proof of Lemma[10]is given in Appendix IB"1 It is based on re-using and modifying 
some ideas introduced in [3 1 for general eVASS games. 

Theorem 11. Let a e Z d r . and s e S. 
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• The problem whether a 6 Cover(s) is PSPACE-hard and solvable in 0(A 2 ■ \S | 2 ) 
time, where A — IJf =l a(i). 

• The problem whether a is a minimal vector of Cover(s) is PSPACE-hard and solv- 
able in \a\ ■(d-(-\S \) 0{d - d - ) time, where \a\ is the encoding size of a. 

• The set of all minimal vectors ofCover(s) is computable in (d ■ I ■ \S \)°^ d ' d ^ time. 



4 Algorithms for one-player and decreasing consumption games 



In this section we present more efficient algorithms for the two subclasses of decreasing 
and one-player consumption games. Observe that these special games can still retain 
a rich modeling power. In particular, the decreasing subclass is quite natural as sys- 
tems that do not decrease some of the resources for a long time most probably stopped 
working completely (also recall that the game considered in Section Q] is decreasing). 
One-player consumption games are useful for modelling a large variety of scheduling 
problems, as it is illustrated in the following example. 

Consider the following a (bit idealized) problem of supplying shops with goods 
such as, e.g., bottles of drinking water. This problem may be described as follows: 
Imagine a map with c cities connected by roads, n of these cities contain shops to be 
supplied, k cities contain warehouses with huge amounts of the goods that should be 
distributed among the shops. The company distributing the goods owns d cars, each 
car has a bounded capacity. The goal is to distribute the goods from warehouses to all 
shops in as short time as possible. This situation can be modeled using a one-player 
consumption game as follows. States would be tuples of the form (ci, . . . , Cd,A) where 
each c, € {1, . . . ,c} corresponds to the city in which the z'-th car is currently located, 
A c {1, . . . , n\ lists the shops that have already been supplied (initially A — and the 
goal is to reach A = {1, . . . , n}). Loads of individual cars and the total time would be 
modelled by a vector of resources, {£{1), . . . , 1(d), ?), where each €(j) models the current 
load of the z'-th car and t models the amount of time which elapsed from the beginning 
(this resource is steadily decreased until A = {1, n}). Player □ chooses where each car 
should go next. Whenever the z'-th car visits a city with a warehouse, the corresponding 
resource {(i) may be reloaded. Whenever the z'-th car visits a city containing a shop, 
player □ may choose to supply the shop, i.e. decrease the resource 6(f) of the car by the 
amount demanded by the shop. Now the last component of a minimal safe configuration 
indicates how much time is needed to supply all shops. A cover configuration indicates 
not only how much time is needed but also how large cars are needed to supply all 
shops. This model can be further extended with an information about the fuel spent by 
the individual cars, etc. 

As in the previous section, we fix a consumption game C = {S,E,(S n ,S $),L) 
of dimension d, and we use I to denote the maximal finite \S(i)\ such that 1 < i < 
d and 6 is a label of some transition. We first establish the complexity of emptiness 
and membership problem, and then present an algorithm to compute the minimal safe 
configurations. 
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4.1 The emptiness and membership problems 

We first establish the complexity of the emptiness problem for decreasing games by a 
polynomial time reduction to generalized Biichi games. A generalized Biichi game is 
a tuple S = (V, m>, (Vp, V<>), B), where (V, h->, (V d , Vo)) is a 2-player game with finitely 
many vertices, and B = {Fi, ...,F m }, where m > 1 and Fj C i-> for all 1 < i < m. 
We say that infinite path w satisfies the generalized Biichi condition defined by B iff 
inf(vv) n Fi + for every 1 < i < m. A strategy cr e Eg is winning in v e V if for 
every n e Fig we have that the Play^^v) satisfies the generalized Biichi condition. The 
problem whether player □ has a winning strategy in state s can be decided in polynomial 
time, with an algorithm of complexity 0(\V\ ■ | i-> | • m) (see Q). 

From Lemma [3] we know that Safe(s) + if and only if the player □ has a winning 
strategy in s in Streett game Sc = (S,E,(S n ,S ( > ),^H) with Streett winning condition 
J[ = {(G u Ri), • ■ • , (G d ,R d )}, where G, = {(s, t) e E | L(s, t)(i) < 0}, and R t = {(s, t) e 
£ | L(s, t)(i) = a)} for every 1 < i < d. We claim that the following holds: 

Lemma 12. IfC is a decreasing game then player □ has a winning strategy in state s of 
the corresponding Streett game Sc if and only if he has winning strategy in generalized 
Biichi game Sc - (S,E, (5 n , 5<>), B) where B — [R\, . . . ,Rd}- 

Previous lemma immediately gives us that the emptiness of Safe(s) in decreasing 
games is achievable in time 0(\S | • \E\ ■ d). We now argue that the emptiness of Safe(s) 
for one-player games can also be achieved in polynomial time. Note that from Lemma[3] 
we have that Safe(s) + if and only if player □ has winning strategy in one-player 
Streett game Sc- The problem of deciding the existence of winning strategy in one- 
player Streett game is exactly the nonemptiness problem for Streett automata that can 
be solved in time 0((\S \-d + \E\) ■ min{|S |, d}) ifltl . 

Theorem 13. Given a consumption game C and a state s, the emptiness problems of 
whether Safe(s) — and Cover(s) — can be decided in time 0{\S\ ■ \E\ ■ d) if C is 
decreasing, and in time 0((\S \ ■ d + \E\) ■ min{|5 1, d}) if C is a one-player game. 

We now study the complexity of the membership problem for Safe(s). We prove 
two key lemmas and the lemmas bound the number of steps before all resources 
are reloaded. The key idea is to make player □ reload resources as soon as possi- 
ble. Formally, we say that a play Play a . jr (s,a) induced by a sequence of transitions 
so ^ • ' • reloads i-th resource in j-th step if 6j(i) = a>. We first present a lemma 
for decreasing games and then for one-player games. 

Lemma 14. Consider a decreasing consumption game C and a configuration (s, a) 
such that a e Safe(s). There is a safe strategy cr of player □ such that every Play^^s, a) 
reloads all resources in the first d ■ \S\ steps. 

Now let us consider one-player games. As player O has only one trivial strategy, n, 
we write only Play^is, a) instead of Play^^s, a). 

Lemma 15. Consider a one-player consumption game C and a configuration (s, a) 
such that a e Safe(s). There is a safe strategy cr of player □ such that for the Play^s, a) 
and every 1 < i < d we have that either the i-th resource is reloaded in the first d ■ \S\ 
steps, or it is never decreased from the (d-\S\ + l)-st step on. 
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As a consequence of Lemma [T4l Lemma [T31 and the hardness results presented in 
Proposition[22](in appendix) we obtain the following 

Theorem 16. The membership problem of whether a e Safe(s) is NP-completefor one- 
player consumption games and PSPACE-complete for decreasing consumption games. 

4.2 Minimum multi-distance reachability 

In this section we show that the problem of computing the set of minimal elements 
of Safe(s) for both one-player and decreasing consumption games can be reduced to 
the problem of computing the minimal multi-distances in consumption games, which is 
defined as follows. 

In the following, T> denotes a li-dimensional consumption game with transitions 
labeled by vectors over Z<o (i.e. there is no a> in any label) and r will be some state of 
D such that the only transition outgoing from r is r^>r (we call such a state a zero- 
sink). We say that vector a is a safe distance from some state s to r if there is a safe 
strategy <x in (s, a) such that for any strategy n for player O the infinite path Play^^s, a) 
visits a configuration of the form (r,/3). That is, a is a safe distance from s to r if player 
□ can enforce reaching r from s in such a way that the total decrease in resource values 
is less than a. 

We denote by Safe^is, r) the set of all safe distances from s to r in D, and by Ad(s, r) 
the set of all minimal elements of S(tfe (s, r). If Safe (s, r) = then we set A^{s, r) = 
(oo, . . . , oo), where the symbol oo is treated accordingly with the usual conventions (for 
any ceZwe have oo - c — oo, c < oo; we do not use the a> symbol to avoid confusions). 

We present a simple fixed-point iterative algorithm which computes the set of min- 
imal safe distances from s to r. Apart from the standard set operations, the algorithm 
uses the following operations on sets of vectors: for a given set M and a given vector a, 
the operation min-set(M) returns the set of minimal elements of M, and M -a returns 
the set {/? - a \ (3 e M\. Further, given a sequence of sets of vectors Mi, ... , M„, the 
operation cwm(Mi, . . . , M m ) returns the set {a\ V • • • V a m \ a\ e Mi, ... , a m e M m ], 
where each a\ V • ■ • V a m denotes a component- wise maximum of the vectors ai , . . . , a m . 

Technically, the algorithm iteratively solves the following optimality equations: for 
any state q with outgoing transitions q -^-> q\,. . .,q q m we have that 



The algorithm iteratively computes the fc-step approximations of A^(q, t), which 
are denoted by A k D (q, t). Intuitively, each set A k D (q, t) consists of all minimal safe dis- 
tances from q to t over all plays with at most k steps. The set A^(q, t) is initialized 
to {(oo, . . . , oo)} for q + t, and to {(1, . . . , 1)} for q — t. Each A k ^ l (q, t) is computed 
from A k (q, t) using the above optimality equations until a fixed point is reached. In 
Appendix|D]we show that this fixed point is the correct solution for the minimal multi- 
distance problem. 

Since the algorithm is based on standard methods, we postpone its presentation to 
Appendix |D]and state only the final result. To estimate the worst-case running time of 



A (q, t) = 



t 



min-set (A D (qi,t) - Si U ••■ U A D (q m ,t) - 8 m ) if q 6 (S n ) D 
min-set ( cvm(A D (qi,t) - Si,.. .,A (q m ,t) - 5 m ) ) if q e (S ) D 
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Min-dist(£),5,r), we denote by b the branching degree of £), i.e., the maximal number 
of transitions outgoing from any state of T>. 

Theorem 17. There is a procedure Min-dist(D, s, r) that correctly computes the set of 
minimal safe distances from s to r in time 0(\S d\ ■ b ■ ({ + l) 2d ■ a 2d+l ^j, where b is the 
branching degree of D and a is the length of a longest acyclic path in D. The resulting 
set A@(s, r) contains only vectors from {(oo, . . . , oo)}U(0, . . . , a-l\ d . I particular, A-£,(s, r) 
has sizeO(a d ■ (( + l) d ). 

In Appendix [D] we actually prove that the running time of the algorithm is 
0(\S d\- a-b ■ N 2 ), where N is the largest possible size of the sets A k D (q, t) that are used 
for computing the fixed point. This is an important observation, because the maximal 
size of these sets corresponds to the maximal number of minimal safe distances from 
any state q to r. If the consumption game T> models some real-world problem, then it 
can be expected that the number of minimal distances from states of D to r is actually 
much smaller then the worst case bound a • (i + 1) , because changes in resources are 
not entirely independent in these models (e.g., action that consumes a large amount of 
some resource may consume a large amount of some other resources as well). 

The proof also shows that the number of iterations needed to converge to the fixed 
point is a, but it can be much smaller in some instances. This observation becomes 
important in Section [4~4l where we apply this algorithm to a game where a is much 
smaller then \S |. Hence, algorithms based on minimal multi-distance reachability have 
a good chance to terminate early, as opposed to the algorithms presented in Section[3] 

4.3 Computing Safe(s) in one-player consumption games 

Now we present an algorithm that for a given li-dimensional one-player consumption 
game C and its state s computes the set of all minimal vectors in Safe(s). The algorithm 
computes the solution by solving several small instances of minimum multi-distance 
reachability problems. 

In this subsection we often work with several consumption games at once. To avoid 
confusion, we denote by Safe c (s) the set of all safe vectors in a state s of the game C, 
and by Iq the maximal finite \8{t)\ such that 5 is a label of some transition in C. We 
drop the subscript if the game is clear from the context. We assume that all states s with 
Safe(s) = were removed from the game. This can be done in polynomial time using 
the algorithm for emptiness (see Theorem[T3l. 

We denote by 11(d) the set of all permutations of the set {1, . . .,d). We view each 
element of 17(d) as a finite sequence ti\ . . .ltd, e.g., 77(2) = {12,21}. We use the stan- 
dard notation n for permutations: confusion with strategies of player O should not arise 
since S <> = 0. 

We say that a play Play a (s, a) matches a permutation n if for every 1 < i < j < d 
the following holds: If both 7r,-th and 7ry-th resources are reloaded during Play a (s, a), 
then the 7r,-th resource is reloaded for the first time before or at the same time as the 
7r/-th resource. A configuration (s, a) matches n if there is a strategy <x that is safe in 
(s, a) and Play a (s, a) matches n. We denote by Safe(s,n) the set of all vectors a such 
that (s, a) matches n. Note that Safe(s) = Safe(s, n). 
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As indicated by the above equality, computation of safe configurations in C reduces 
to the problem of computing, for every permutation 71, safe configurations that match 
7r. The latter problem, in turn, easily reduces to the problem of computing safe multi- 
distances in the one-player consumption games C(n) defined below. Intuitively, each 
game C{n) simulates the game C where the resources are forced to be reloaded in the 
order specified by n. So the states of each C(n) are pairs (s, k) where s corresponds to 
the current state of the original game and k indicates that the first k resources, in the 
permutation n, have already been reloaded. Now the crucial point is that if the first k 
resources have been reloaded and there is a safe strategy in some configuration of the 
form (s,/3), for some /3, which does not decrease any of the resources with the index 
greater than k, then we may safely conclude that the initial configuration is safe. So, in 
such a case we put a transition from (s, k) to a distinguished target state t (whether or 
not to put in such a transition can be decided in polynomial time due to Theorem [Oil. 
Other transitions of C(n) correspond to transitions of C except that they have to up- 
date the information about already reloaded resources, cannot skip any resource in the 
permutation (such transitions are removed), and the components indexed by it\, . . . , 77* 
are substituted with in transitions incoming to states of the form (q, k) (since already 
reloaded resources become unimportant as indicated by the above observation). 

A complete construction of C(n) is presented in Appendix [D] as a part of a formal 
proof of the following lemma: 

Lemma 18. For every permutation n there is a polynomial time constructible consump- 
tion game C(tt) of size 0(\S \ ■ d) such that for every vector a we have that a € Safe(s, n) 
in C iff a is a safe multi-distance from (s, 0) to t in C{n). 

By the previous lemma, every minimal element of Safe(s) is an element of 
^■C(n){{s, 0), t) for at least one permutation n. Our algorithm examines all permuta- 
tions 7r e 17(d), and for every permutation it constructs game C(n) and computes 
A.c(n)((s, 0), f) using the procedure Min-dist from Theorem[T7] The algorithm also stores 
the set of all minimal vectors that appear in some Ac(k)((s, 0), t). The pseudocode of the 
algorithm can be found in Appendix iDltogether with a proof of the following theorem: 

Theorem 19. There is a procedure Min-safe(C, s) that correctly computes the set of 
all minimal elements of Safe(s) in C. The worst-case complexity of the algorithm is 
0(d\-\S\ 2d+3 ■({+l) 2d -d 2d+2 ). 

In contrast with the algorithm of Section |4T1 that constructs a game which always 
has size 0(d d ■ \S \ d+x C d ), the algorithm of this section computes d\ "small" instances of 
the minimal multi-distance reachability problem. We can solve many of these instances 
in parallel. Moreover, in the previous section we have argued that each call of Min- 
dist(C(7r), (s, 0), r) may have much better running time than the worst-case upper bound. 
This suggests that algorithm Min-safe should be more useful for practical instances. 

4.4 Computing Safe(s) in decreasing consumption games 

In this section we describe an algorithm that for a given decreasing consumption game 
C and a state s of C computes the set of minimal elements of Safe(s). The main idea is 
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again to reduce this to task to the computation of minimal multi-distances in a certain 
consumption game. We again assume that states with Safe(s) = were removed from 
the game. 

The core of the reduction is the following observation: if C is decreasing, then a e 
Safe(s) iff player □ is able to ensure that the play satisfies these two conditions: all 
resources are reloaded somewhere along the play; and the 2-th resource is reloaded for 
the first time before it is decreased by at least a(j). Now if we augment the remaining 
states of C with an information about which resources have been reloaded at least once 
in previous steps, then the objective of player □ is actually to reach a state which tells 
us that all resources were reloaded at least once. 

So the algorithm constructs a game C by augmenting states of C with an information 
about which resources have been reloaded at least once, and by substituting updates 
of already reloaded resources (i.e., the corresponding components of the labels) with 
zeros. Note that the construction of C closely resembles the construction of games C{n) 
from the previous section. However, in two-player case we cannot fix an order in which 
resources are to be reloaded, because the optimal order depends on a strategy chosen 
by player O. Thus, we need to remember exactly which resources have been reloaded 
in the past (we only need to remember the set of resources that have been reloaded, but 
not the order in which they were reloaded). 

We leave the formal construction of game C to Appendix iDl together with a proof 
of the following theorem which only states the final reults. 

Theorem 20. LetC — (S ,E,(S a ,S $),L)be a decreasing consumption game. For every 
state s of C, the set of minimal elements ofSafe(s) is exactly the set A^{(s, 0), t) of safe 
multi-distances from s to t in C. The set of minimal elements ofSafe(s) is computable in 
time 0(2 d ■ d 2d+2 ■ \S \ 2d+3 ■ (i + l) 2d ). 

In the case of decreasing games, we cannot gain a solution by solving many "small" in- 
stances of the multi-distance reachability problem in parallel, which was an important 
advantage of the algorithm for one-player consumption games given in the previous sec- 
tion. However, the game C constructed by the algorithm presented in this section is still 
smaller than the safety game constructed by the "naive" algorithm of Corollary [8] The 
length of the longest acyclic path in C is bounded by \S \ ■ d, so the procedure Min-dist 
does not have to perform many iterations. Most importantly, the term (\S \ ■ (I + l)) d van- 
ished from the size of C. Eventhough this term appears in the worst-case complexity of 
Min-dist, in Section [4721 we have argued that it is likely that Min-dist performs better. 
Thus we still see algorithm of this section preferable to the algorithm of Section [3] 

5 Conclusions 

As it is witnessed by the results presented in previous sections, consumption games rep- 
resent a convenient trade-off between expressive power and computational tractability. 
The presented theory obviously needs further development before it is implemented in 
working software tools. Some of the issues are not yet fully understood, and there are 
also other well-motivated problems about consumption games which were not consid- 
ered in this paper. The list of important open problems includes the following: 
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• Improve the complexity of algorithms for Cover(s). This requires further insights 
into the structure of these sets. 

• Find efficient controller synthesis algorithms for objectives that combine safety with 
other linear-time properties. That is, decide whether player □ has a safe strategy such 
that a play satisfies a given LTL property no matter what player <> does. 

• Find algorithms for more complicated optimization problems, where the individual 
resources may have different priorities. For example, it may happen that fuel con- 
sumption or the price of batteries with large capacity are much more important than 
the time spent, and in that case we might want to optimize some weight function over 
the tuple of all resources. It may happen (and we have concrete examples) that some 
of these problems are actually solvable even more efficiently than the general ones 
where all resources are treated equally w.r.t. their importance. 

The above list is surely incomplete. The problem of optimal resource consumption is 
rather generic and appears in many different contexts, which may generate other inter- 
esting questions about consumption games. 
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Technical Appendix 



A Proofs about Street games 

LetC = (S, E, (So, S o), L) be a consumption game of dimension d. 

Lemma |3j Let Sc - (S,E,(S n ,So),&l) be a Streett game where 3{ = 
{(GuRi),...,iG d ,R d )}, d = [(s,t) e E | L(s,t)(i) < 0}, and Ri = {(s,t) e £ | 
L(s, t)(i) — <jj) for every 1 < i < d. Then for every s € S the following assertions 
hold: 

1. IfSafe(s) + 0, then player □ has a winning strategy in s in the Streett game Sc- 

2. If player □ has a winning strategy in s in the Streett game Sc, then 
(d\ ■ \S\ -l + \,...,d\ ■ \S\ ■{+ 1) e Safe(s) n Cover(s). 

Proof. First we observe that every strategy for the players in the Streett game is a strat- 
egy in the original consumption game and vice versa. 

1 . To prove the result of the first item, we show the contrapositive, i.e., we show that if 
player □ does not have a winning strategy in s in the Streett game, then Safe(s) = 0. 
If player □ does not have a winning strategy, then by the determinacy of Streett 
games and by existence of memoryless winning strategies for Rabin winning con- 
dition (complement of Streett winning condition) [8|, there exists a memoryless 
winning strategy n for player O. Fix the memoryless winning strategy n and con- 
sider the graph S^. obtained by fixing the strategy, and in S c only player □ makes 
choices. Consider any strongly connected component (sec) U reachable from s in 
S^, (we denote by U the set of edges of the sec). Then there must exist 1 < i < d 
such that UnGi ± and UC\Ri = 0; otherwise, against n, player □ can visit exactly 
the sec U infinitely often and ensure the Streett winning condition, contradicting n 
is a winning strategy. Consider the strategy ji in the original consumption game, 
and any arbitrary strategy <x for player □. Let w = Play^^s) be the play given cr 
and n from s, and let U' = inf(w). Then U' is a sec in S^., and hence there exists 
1 < i < d such that U' n G, ■ + and U' n = 0, i.e., in dimension i there exists at 
least one transition e\ where the weight is negative and there is no transition where 
the weight is positive. Since e\ is executed infinitely often and there is no increase 
in dimension i from some point on (as the set executed infinitely often is U' with no 
w-weight transition in dimesion /), it follows that irrespective of the initial vector a 
we have (s, a) i Safe(s). This completes the first item of the result. 

2. We now show the second item. If there is a winning strategy in the Streett game in 
s, then it follows from the results of 013171 that there exists a winning strategy cr of 
memory size at most dl. Fix the winning strategy cr of memory size dl, and consider 
the graph S^. obtained by fixing cr (it is the synchronous product obtained from Sc 
and the deterministic transducer of size dl describing cr). In S^ only player O makes 
choices, and the size of the vertex set of the graph is dl ■ \S\. Consider a cycle U 
reachable from (s, mo) in S^,, where mo is the initial memory state. The cycle U 
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must satisfy that for all 1 < i < d if U D G, ^ 0, then U C\Ri + 0; otherwise visiting 
exactly the cycle U infinitely often against cr player O can contradict that cr is a 
winning strategy. It follows that given the strategy cr, given any arbitrary counter- 
strategy n for player O, for all 1 < i < d, within every d\ ■ \S\ visits of G, there 
is at least one visit to /?,-. In other words, in the consumption game, the strategy cr 
ensures that for every dimension i, within d\-\S\ visits of negative weight transitions 
there is at least one w-weight transition. Since the maximum negative weight is at 
most I, it follows that starting with weight vector (d\ ■ \S\ ■ I + 1, . . . ,d\ ■ \S | ■ €+ 1), 
and whenever a w-weight transition is visited in dimension i reloading upto value 
d\-\S\-£+l, ensures that (d\ ■ \S \ ■ I + 1, . . .,d\ ■ \S\ ■ (+ 1) € Safe(s) n Cover(s). 

The desired result follows. □ 



B Proofs based on analyzing abstract load vectors 

In this section we present full proofs of Lemma [3] and Theorem [8] These proofs are 
based on similar underlying ideas, inspired by techniques originally presented in Q. 

For the rest of this section, we fix a consumption game C = (S,E, (S n ,S^), L) of 
dimension d, and we use t to denote the maximal finite \6(i)\ such that 1 < i < d and 5 
is a label of some transition. 

Let us recall the notions introduced in Section [3] An abstract load vector n is an 
element of (Z"„) fl '. For every 1 < i < d, we say that n(i) is precise if Ti{i) + a>. The 
precision of n is the number of its precise components. The type of n is the set T of all 
indexes i such that n(i) + a>. For every abstract load vector n, every 1 < j < d, and every 
k e Z" , we use n[jlk\ to denote the abstract load vector obtained from n by replacing 
its j-th component with k. Further, for every a e (Z>o) rf and every type T we define the 
corresponding abstract load vector n a of type T where n a {m) - a(m) for all m € T. We 
also use lj, where 1 < j < d, to denote the vector whose j-th component is equal to 1 
and the other components are equal to 0. 

The standard componentwise ordering is extended also to abstract load vectors by 
stipulating that c < to for every c e Z. Given an abstract load vector n and a vector 
a E (Z>o) flf , we say that a matches n if a(i) = 7r(i) for all 1 < i < d such that n(i) + u. 
Further, we say that n is compatible with Safe(s) (or Cover(s)) there is some a e Safe(s) 
(or a e Cover(s)) that matches n. The set of all abstract load vectors that are compatible 
with Safe(s) (or Cover(s)) is denoted by S-comp(s) (or C-comp(s)). 

Let 7T e S-comp(s), and a e Safe(s) a vector that matches n. Let K e Z>o. We say 
that a strategy cr which is safe in (s, a) stays above K with respect to n if for every 
strategy n of player O and every configuration (t,/3) visited along Play a) we have 
that f3(i) > K for every 1 < i < d such that 7i(i) = a>. The following lemma is immediate. 

Lemma 21. Let n e S-comp(s). Then for every K e Z>o there is a vector a e Safe(s) 
and a safe strategy cr for (s, a) such that a matches n and cr stays above K with respect 
to n. 

Now we have all the tools needed to prove Lemma[6] 
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Lemma|6j For every s e S and every minimal a E Safe(s) we have that a(i) < d ■ i ■ \S \ 
for every 1 < i < d. 

Proof. We show that for all < i < d and s e S , the precise components of all minimal 
n E S-comp(s) with precision i are bounded by i-£-\S\. This clearly suffices, because the 
minimal n E S-comp(s) with precision d are exactly the minimal elements of Safe(s). 

We proceed by induction on i. The case when i — is immediate, because the only 
abstract vector with precision is (to, . . . , to), and the claim holds trivially. Now assume 
that the claim holds for some i < d. Let us fix some minimal n £ S-comp(s) with 
precision i. Let T be the type of n, and let j £ T, 1 < j < d, be an arbitrary (but fixed) 
index. Further, let k be the least number such that n[j/k] e S-comp(s), and let K E Z>o 
be (some) number such that 

• for every / £ S and every n' e S-comp(t) of type T U { j} such that n' < n[j/k] we have 
that the vector obtained from n' by substituting every w-component with K belongs 
to Safe(t); 

• for every t € S and every minimal n' e S-comp(t) of precision i we have that the 
vector obtained from n' by substituting every w-component with K belongs to Safe(t). 

Such a K clearly exists because the total number of all such n' is finite. 

We claim that k < (i+\)-(-\S\. Assume the converse. We show that then nlj/k-l] e 
S-comp(s), which contradicts the minimality of k. 

Since n[j/k] e S-comp(s), due to Lemma |2T1 there exists a e Safe(s) and a strategy 
cr for player □ such that 

• a matches n[j/k], 

• cr is safe in (s, a) and stays above K with respect to T U {j}. 

It suffices to show that there is a safe strategy cr' for player □ in (s, a[j/k-l]). The 
strategy cr' keeps mimicking the moves of cr until one of the following conditions is 
satisfied: 

C 1 . The play visits a configuration (f , y) such that the corresponding abstract load vector 

iiy for type T U {]} satisfies n y € S-comp(t) and n y < n[j/k]. 
C2. The play visits a configuration (f, y) such that t e S n and there is a transition 

(f, y) (v,j8) such that 5{m) = a> for some m e T U [j] and the configuration 

(v,j8 + If) is safe. 

First we show that if player □ follows the strategy cr' from (s, a[j/k-l]), then the j-th 
resource cannot be decremented by more than \S\ - 1 times along a play unless CI 
or C2 happens. Assume the converse, i.e., player O has a strategy n such that the y'-th 
resource is decremented \S \ times along Play^, ^(s, a[j/k-l]) without encountering CI 
or C2. Then there must be two configurations of the form (f, y) and (f, g) such that for 
every m e T U {j} we have that 7r[j/k](m) > y(m) > g{m) and y(j) > g{j). We show 
that (t,y) satisfies CI. Clearly, n 7 < n[j/k]. It remains to show that 7T y e S-comp(t). 
This is achieved by identifying another abstract load vector n' of type T U {j} such that 
n' < n y and n' e S-comp(t). Since cr' mimics cr and cr is safe in (s, a[j/k]), we have 
that (t, g + lj) is safe. However, this means that the corresponding vector n g +i. for type 
T U { j) satisfies n e +ij £ S-comp(t), and we also have that < n y as required. 
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Obviously, if CI happens, then the configuration (f, y) is safe because cr stays 
above K and cr' mimics cr. Hence, player □ can simply switch to a safe strategy for 
(t, y). If C2 happens before CI, i.e., the play visits a configuration of (t, y) such that 
t 6 S a and there is a transition (t,y) -4 (v,y6) such that (5(to) = w for some to e T U {7} 
and the configuration (v,yS + lj) is safe, then /?(/) > z • £ • \S\, because the j-th resource 
could not be decremented by more than \S \ - 1 transitions so far. Now consider the cor- 
responding abstract load vector 71/3+1, f° r type (T U {j}) \ {to}. Since e S-comp(v) 
and its precision is i, we can apply induction hypothesis and conclude that there is an- 
other n' e S-comp(v) of the same type (T U {j}) \ {to} such that n' < np+\ and all precise 
components of n' are bounded by i ■ t ■ \S\. However, this means that n' < n y , where 
7r r is the abstract load vector for type (T U {j}) \ {to} corresponding to y. Hence, if cr' 
executes the transition (t, y) -4 (v,/3') where f3' is the same as j3 but the TO-th component 
is reloaded to at least K, then (v,/?') is a safe configuration, and cr' can switch to a safe 
strategy for (v,/3'). 

If CI and C2 are not encountered at all, then the j-th resource cannot be decre- 
mented by more that \S \ - 1 transitions at all, and hence cr' can mimic cr safely forever. 

So, we have shown that if n e S-comp(s) is a minimal abstract load vector of type 
T with precision i, then for every /' i T, 1 < i < d, we have that the least k such that 
e S-comp(s) is bounded by (z + 1) ■ I ■ \S\. However, not all minimal elements 
of S-comp(s) with precision i + 1 are necessarily of this form. So, let n' e S-comp(s) 
be some (unspecified) minimal abstract load vector of type T with precision i + 1, and 
suppose that ji'(m) > (i + 1) • (, ■ \S \ for some to € T . Let n" be the abstract load vector 
of type T \ {to} obtained from n' be replacing n'(m) with a>. Since the precision of n" 
is i and n" € S-comp(s), we can apply induction hypothesis and conclude that there is 
a minimal n e S-comp(s) of type T \ {to} such that 7r < n" . By the above presented 
argument, the vector n[m/k] of type T \ {to}, where k - (i + 1) • I ■ \S\, belongs to 
S-comp(s). Since 7i[m/k] < n' and the TO-component of n[m/k] is even strictly smaller, 
we have a contradiction with the minimality of 7r'. □ 

Now we show how to compute the set of minimal elements of Covert s) for every 
s e S . For every < i < d, let B, e Z>o be the least number such that for every s e S , 
every minimal abstract load vector n e C-comp(s) of precision i, and every a e X d >Q 
obtained from n by substituting every a> with Z?,- we have that a e Coverts). 

Lemma 1101 For every s e S and every minimal a € Cover(s) we have that a(i) < 
(d-i-\S\) d - for every I < i < d. 

Proof. Similarly as in the proof of Lemma|6] we show that for all < i < d and s e S , 
the precise components of all minimal n e C-comp(s) with precision i are bounded 
by (d ■ £ ■ \S\)''. By induction on i. The base case when i = follows directly from 
Lemma [6] Now assume that the claim holds for some i < d. Let us fix some minimal 
n G C-comp(s) with precision Let T be the type of n, and let j £T,\ < j < d, be an 
arbitrary (but fixed) index. Consider the least k such that n[j/k] e C-comp(s). We argue 
that k < (d ■ I ■ |S|) (!+1)! . To see this, realize that we can construct another consumption 
game C n of dimension d—i where the current values of all resources indexed by T are en- 
coded explicitly in the states of C„. By applying induction hypothesis, we can conclude 
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that the number of states of C n is bounded by \S \ ■ (d- £ ■ \S |) w! . By applying Lemma|6]to 
C n , we can conclude that a sufficient value for k is at most (d - i) ■ £ ■ \S\ ■ (d ■ £ ■ \S 
which is bounded by (d ■ €■ \S The rest of the argument is the same as in the proof 

of Lemma|6] □ 

Due to Lemma[10] all components of all minimal elements of Cover(s) are bounded by 
(d ■ I ■ \S \) dl . Hence, there are (d ■ I ■ \S |) possible candidates for the minimal elements 
of Cover(s), and we can (in principle) check them by computing a winning region in a 
safety game of size \S\-(d-£-\S \) dd] . Hence, the set all minimal elements of all Cover(s), 
s e S , is computable in time (d ■ £ ■ \S \) 0( - d ' dl) . Thus, we obtain the upper complexity 
bounds of TheoremfTTI 



C Proofs of lower bounds 

For any n e Z U {a>} we denote n the vector (n, . . . , n). 

Proposition 22. The problems whether a given vector a e Z> () is an element ofSafe(s), 
element of Cover(s), minimal element of Safe(s) or minimal element of Cover(s), in 
state s of given d-dimensional consumption game C are: 

• NP-hard for 1 -player consumption games. 

• PSPACE-hardfor 2-player consumption games. 

Moreover the problem whether a given vector a e Z^ () is a minimal element ofSafe(s) 
is DP -hard for 1 -player consumption games. Moreover, all of these lower bounds hold 
even if we restrict ourselves to games where components of all labels are nonzero. 

We will prove Proposition|22]in two stages. First, we will show the following: 

Lemma 23. The problems whether a given a € Z^ is a (minimal) element of 'Safe(s) 
( Cover(s)) is NP-hard for 1 -player consumption games and PSPACE-hardfor 2-player 
consumption games, even if we restrict ourselves to games where components of all 
labels are nonzero.. 

Proof. We will start by proving that deciding whether a is (minimal) safe vector in state 
s is PSPACE-hard for 2-player games, by reduction from QBF (we can use the same 
reduction for both problems). Once the proof is finished, it will be obvious how to adapt 
the reduction to prove NP-hardness for 1 -player case. 

Let ifr = Q\X\ . . . Q„x n (p be a quantified boolean formula, where Q t e {V, 3} for 
every 1 < i < n, and where ip is a quantifier-free formula in conjunctive normal form 
with clauses C\,...C m over variables X\,...,x n . We will show how to construct (in 
time polynomial in size of if/) a 2-player consumption game G,p of dimension m with a 
distinguished state S[ such that if/ is true iff the vector 2n + 1 is minimal safe vector in 

Si. 

is constructed as follows: For every 1 < i < n we add states s,, s Xi and s^ Xi , 
then we add new states s„ + \ and r. For every 1 < i < n the state s, belongs to Player 
O iff Qi = V. All other states belong to player □. Then, for every 1 < i < n and for 
both U e [xi, ->Xi] we add transitions s, s Xi and s^ ^* The labels on these 
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Fig. 1. Game for formula ifi = 3xi Vx23x3(xi V -1x2 V X3) A (-1X1 V* V X3) A (-1X1 V -1x2). 



transitions are defined in the following way: for every 1 < i < n, every L, e {x,, -a,} 
and every 1 < j < m we set: 



Finally, we add transitions s, I+ \ — > r and r — > r. 

Figure[T]illustrates the construction for a specific formula if/. 

Now any pair of strategies cr, 71 of players □ and O, respectively, determines a 
truth assignment v^: x\,...,x„ — > {0,1} in obvious way: we set Vo- iJr (x;) = 1 iff 
Play^^isuln + 1) visits state s Xj (it is obvious that this definition is correct). Thus, 
formula if/ is true iff Player □ has a strategy cr such that for every strategy n of O the 
assignment v^ n satisfies ip. But v cr ,^(<^) = 1 iff for every 1 < j < m we have Va-^(L) = 1 
for some literal L in Cj, which happens if and only if Play^Jsi^n + 1) visits state 
Sl- Furthermore, Play^Jsx^n + 1) visits some with L in Cj if and only if the j- 
th resource can be reloaded on Play^^si^n + 1) before state s n+ \ is reached. Putting 
these observations together we see that if/ is true if and only if Player □ can enforce the 
reload of every resource before the state s n+ \ is reached (i.e. during the first 2« steps). 
We claim that this happens if and only if 2n + 1 is safe in s\. 

Note that no matter what the players do, no resource can decrease by more than 
2n before the play either reaches s„ + i or uses a transition that permits reload of this 
resource. Thus, if □ can enforce reload of every resource in first 2n steps, then any vector 
a > 2n + 1 must be safe. On the other hand, if for every strategy cr of □ the player O 
can prevent reload of at least one resource during first 2« steps, then no matter what 
□ does at least one resource must have value exactly 1 when s„+i is reached (provided 
that in s\ all resources where initialized to 2n). Then player □ cannot use the transition 
from s„ + i to r and he is thus forced to visit configuration F, showing that 2n + 1 cannot 
be safe in si. 

We have proved that if/ is true iff 2n + 1 is safe in si (and thus we have already 
proved the PSPACE-hardness of deciding whether a given vector is safe). Now we show 
that no vector y < 2n + 1, y + 2n + 1 can be safe in si . This will prove that if/ is true iff 
2n + 1 is minimal safe vector in S\. The crucial observation is that no matter what the 




-2(n - i + 1) if Li is in Cj 



- 1 otherwise 



-1 
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two players do, every resource decreases by at least 2n before it can be reloaded for the 
first time. Thus, in order to win, Player □ must start with all resources initialized to at 
least 2n + 1. 

Now to show the NP-hardness of decision problems for 1 -player games, it suffices 
to show a reduction from SAT, i.e. from restricted version of QBF where all quantifiers 
are existential. It is easy to see that in this restricted case the reduction presented above 
produces a 1 -player game. 

Finally observe, that in both reductions, if 2n + 1 was safe in S\ then player □ 
never needed to reload any resource to value greater than 2n + 1 in order to win. Also, if 
2n + 1 e Safe(s\) then2w + 1 was minimal element of Safe(s\). Thus, in our reductions 
2n + 1 was (minimal) element of Safe(s) if and only if it was the (minimal) element of 
Cover(s). This gives us the lower bounds on membership problems for Cover(s). □ 

For better clarity we again restate the important observation about game from 
the proof above. This observation will be useful in the proof of DP-hardness below. 

Claim. (1.) Let ip be a propositional formula (i.e. existentially quantified boolean for- 
mula) in CNF with m clauses and n variables. Then the player can reload every resource 
before he reaches the second-to-last state of G^ <=> <p is satisfiable. 

Lemma 24. The problem, whether a given vector a is a minimal safe vector in a given 
state s is DP-hard for 1 -player games. 

We will present a reduction from SAT-UNSAT. 

Let (<p, iff) be any pair of propositional formulas in CNF with <p — C\ A • • • A C m is 
formula over variables X\,...,x n and iff = C m+i A • • • A C m+m > contains only variables 
yu . . . , y„> . Denote M = m + rri + 1 . We will show how to construct (in time polynomial 
in size of (tp, iff)) a 1 -player consumption game of dimension M with distinguished 
state Si and vector £ such that (si,£) is a safe configuration of G v $ if and only if ip is 
satisfiable and if/ is unsatisfiable. 

Game G^ consists of two gadgets, H v and H^. Construction of these gadgets is 
very similar to construction of games G v , Gf from previous lemma (where we treat <p 
and if/ as existentially quantified formulas). However, apart from different dimension of 
labels there are other subtle differences; therefore we give explicit description of these 
constructions. 

Let us start with H^: for every 1 < ;' < n we have states s it s x . and s^ Xj , we also 
add two new states s n+ \ and r. For every 1 < i < n there are transitions from s,- — > s Xi , 
si — > s^ Xj , s Xi — > Si+\ and s^ Xi — > s,-+i. Moreover, there is transition s n +\ — > r. For the time 
being we leave r without outgoing transition: we will append gadget to r later. 

We now define labeling of transitions in H v . For every 1 < i < n and L, e {x, , -a,} 
denote a L . the label of transition s, — > s Lj and /5 L . the label of the single transition out- 
going from si t . Then for every 1 < j < M we define 




-2{n - i + 1) if j < m and L, is in Cj 
-1 otherwise 

oj if j < m and L, is in Cj 
-1 otherwise 
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We will now continue with the construction of //</,. For every 1 < i < n' we have 
states f,-, t y . and t-y ( together with additional states t n >+\,p,p'. For every 1 < i < n' 
and every L,- € {y,-, -ry,-} we have transitions f, ti i and f, + i; finally we have 

transitions t n '+\ A p, r„, +1 ^> p' , p^> p and p' A p'. The labels are defined as follows: 
we have n = (u>, . . . ,a>, — 1, . . ., — 1, -2), n' = (lj, . . . , a>, -2, . . ., -2, -1) and for every 

m m' m m' 

1 <i <n' and every L,- e {y,-, -iy,) we have: 

-2(n' - k + 1) if m < j < M and L,- is in C,- 
7t,-(7) = ' fc> if 1 < j < m 

- 1 otherwise 



\(jl> if 1 < j < m or if m < j < M and L, is in C, 
1-1 otherwise 



The example of gadgets for specific formulas can be seen on Figure |2] Note that 
can be viewed as game from proof of Lemmal23l and thus the observation in Claim 
(1.) can be applied to this game. 

We now join gadgets and into one game by adding transition r' A ti, 
where p = (u>, —1, .. ., -1). Let £ be such that 



In + 1 if 1 < j < m 

2n + 2n' +4 if m < j < m' 
2n + In' + 5 if j = M 



We claim that (jp, ip) is a positive instance of S AT-UNS AT iff £ is minimal safe vector 
in s\. We will refer to resources with indexes between m and m + m' as intermediate 
resources. 

Suppose that £ truly is minimal safe vector in S\. Then the player must be able 
to reach one of states p, p' without exhausting any resource. In particular, in order to 
be able to use transition from s„+\ to r the player must be able to reload each of the 
first m resources before he reaches s n+ \, because each of these resources decrease by 
at least 2n before this state is reached. From Claim (1.) it follows that <p is satisfiable. 
Moreover, when player reaches t\, the value of all intermediate resources is exactly 
2n' + 2. Assume, that iff is satisfiable. Then the player can continue from t\ and reload 
all intermediate resources before f„< + i is reached, by playing in accordance with the 
satisfying assignment for if/. The player thus reaches t„> + \ with the last resource having 
value exactly 3 and all other resources having arbitrary large value. He can then use 
transition from ?„< + i to p' decreasing the last resource by 1 . It is clear, that the player can 
use this strategy to reach p' even if he starts with resources initialized to £-(0, . . . , 0, 1), 
a contradiction with minimality of This shows, that iff must be unsatisfiable. 

Now assume that (if, if/) is a positive instance of SAT-UNSAT. Since <p is satisfiable, 
the player can reload all of the first m resources before he reaches s„+i. Thus, he can 
reach t\ with first m resources having arbitrary large value, the intermediate resources 
having value exactly 2n' + 2 and the last resource having value exactly 2n' + 3. Then, 
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Fig. 2. Gadgets H 9 and for <p = x\ V -1X2 and ij/ = (y 1 V y 2 ) A (-ryj V -13*3). The transition that 
joins gadgets into game is indicated. The game starts in configuration (ii,(5, 12, 12, 13)). 

no matter which path through the player chooses, he can reach t„> +1 without ex- 
hausting any resource and with intermediate resources having value at least 2 and the 
last resource having value exactly 3 when f„/+i is reached (he just needs to reload any 
intermediate resource to 2n' +2 if he has the possibility to do so along the path). He can 
then use the transition to p and win the game. This proves that £ is safe in S\. Suppose 
that there is some y < y + £ that is safe in s\. Clearly, all components of y and 
except for the last one, have to be equal: each of the first m resources is decreased by 
at least 2n and every intermediate resource is decreased by at least 2n + 2n' + 2 before 
t n >+\ is reached. Thus, we must have y(M) <2n + 2n' + 4. But if the player starts with 
M-th resource initialized to y(M), the value of this resource is at most 2 when t n , + \ is 
reached. Now the only thing the player can possibly do in order to win is to use the 
transition from t^+\ to p' . But to do this, he must reload every intermediate resource 
before he reaches t„>+\. But this is not possible, since i// is unsatisfiable. Thus, y cannot 
be safe in S\. 

D Proofs for decreasing and one-player consumption games 
Proof of Lemma [12] 

We first observe that a winning strategy in the generalized Biichi game ensures that 
every Rj is visited infinitely often, and hence also satisfies the Streett condition. Thus 
a winning strategy in the generalized Biichi game is a winning strategy in the Streett 
game. We now argue for the other direction. Consider a winning strategy <x for the 
Streett game. We argue that the strategy also ensures the generalized Biichi condition. 
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Consider an arbitrary strategy n for player O, and let w = Play^^s). Consider U = 
inf(w). Then U is a sec, and hence must contain cycles. Since the game is decreasing, 
it follows that for every dimension z, there is at least one transition in U with negative 
weight or u> in dimension i. Hence for all 1 < i < d either U n Ri + or else we have 
U nG, + 0. Since cr is winning for the Streett condition, for all 1 < i < d if U n G,- ^ 0, 
then U dRi + 0. Hence for all 1 < i < d we have U n + 0. Hence all the Buchi 
objectives are satisfied. Hence cr is a winning strategy for the generalized Buchi game. 
The desired result follows. 

Proof of Lemma [14] 

Intuitively, it follows from the fact that if a state of S is visited twice (i.e. the play 
follows a cycle in the state space) without reloading a "new" resource in the cycle (i.e. 
a resource which has not been already reloaded before the cycle) then player □ may 
improve her strategy by simply omitting the cycle. More precisely, assume that there is 
a play which visits the same state in the z'-th and the j-th steps (z < j) and every resource 
reloaded between the z'-th and the ./'-the steps has already been reloaded before the z'-th 
step. Then note that the values of resources that are not reloaded between the z'-th and 
the j-th steps can only become smaller there, and as every resource reloaded between 
the /-th and the j-th steps has already been reloaded before the z'-th step, it suffices to 
make the reloads before the /-th step a bit larger to compensate for removing the reloads 
between the /-th and the j-th step. 

Thus we may modify the strategy cr for player □ so that whenever a play follows 
the first z steps of the above fixed play, the new strategy, cr', starts behaving as if it 
has already followed j steps of the play (thus removing the part between the /-th and 
the j-th steps). Moreover, to be safe, the new strategy has to reload a bit more before 
reaching the /-the step (by e.g. the total value reloaded between the z'-th and the j-th 
step of the fixed play). The resulting strategy will be safe and will reload all resources 
sooner than the original strategy (at least in some plays). We need the decreasing games 
in order to be sure that the process of removing cycles will eventually stop (observe 
that in decreasing games every safe strategy reloads all resources in a bounded number 
of steps no matter what player O is doing) and produces a strategy which reloads all 
resources in at most d ■ \S\ steps. 

Now we present the formal proof. 

Let us fix a safe strategy cr. Denote by Fg- t „ the number of steps the Play ^^{s, a) 
needs to reload all resources. Denote by F a the maximum max^ F <TIT . It follows from 
the fact that C is decreasing that F a is finite for every safe cr (otherwise a play won by 
n could be easily constructed due to the fact that once cr is fixed, the game is finitely 
branching). 

If Fa- < d ■ \S |, we are done. So assume that F CT > d ■ \S\ and that cr minimizes iv 
among all safe strategies. Given n, denote by the shortest prefix of Play^^s, a) in 
which all resources have been reloaded. Denote by H a the number of distinct paths Wa-. n 
of length Fa- and assume that cr minimizes among all safe strategies. We show that 
there is cr' such that either F^ < Fg-, or F a i = F a and < Hg-, a contradiction with 
minimality of Fa- and Hg-- 
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Observe that if we fix the initial configuration, (s, a), then strategies can be seen 
as functions from sequences of states to transitions of C together with values to which 
resources are reloaded. This information suffices to uniquely determine behavior of 
player □ in plays initiated in (s, a). 

Consider one of the paths w lT _„ = (so, uo) ••• «*) of length k = /v. As/v > d-\S\, 
there must be i < j such that s,- = Sj and all resources reloaded between the 2-th step 
and the j'-th step of have already been reloaded before the 2-th step of w (TIT . Denote 
by B the largest value to which any of the resources is reloaded between the 2-th and 
j-th step of wv iJr . We define choices of a new strategy, cr', for all possible histories (i.e. 
sequences of states visited before the choice) w e S + : 

• If w = Sq ■ ■ ■ Sp, where k' < i, then cr' chooses the same transition as cr for the history 
w, but whenever cr reloads a resource to m, cr' reloads the same resource to m + B. 

• If w = so ■ ■ ■ S(t\ ■ ■ ■ t/c', then cr' chooses the same transition as cr for the history 
sq ■ ■ ■ SiSi+i ■ ■ ■ Sjt\ ■■■tj ( f. Moreover, whenever cr reloads a resource to a value m, cr' 
reloads the same resource to the value m. 

• For all other histories the strategy cr' chooses the same transition as cr and reloads 
resources to the same values as cr. 

Note that cr' is still safe. Indeed, if 

Play^ ^s, a) = (s , a ) ■ ■ ■ (s h a,)(fi, a i+l )(t 2 , a i+2 ) ■ ■ ■ 

then there is a strategy n' such that 

Playo-yis, a) = (s , a' )--- (s h a'^Si+uBi) ■ ■ ■ (sj,B H )(t u a' M )(t 2 , a' i+2 ) ■ ■ ■ 

It is easy to observe that a'.( j) < a,(j) for all i, j. 

Note also that F lT > < F, T . Moreover, all plays according to cr' that start with the se- 
quence of states so ■ ■ ■ s, reload all resources earlier than corresponding plays according 
to cr. Thus either /v < F^, or F^ = F^ and < H a , a contradiction with minimality 
of Ffj and H a . 

□ 

Proof of Lemma H5l 

The idea is basically the same as in the case of decreasing games. Assume that cr does 
not satisfy the conclusion of the lemma. Then the play PlayJ^s, a) must follow a cycle 
in the state space which does not reload any "new" resource that has not been reloaded 
before the cycle. This cycle can be simply omitted and its reloads may be added to 
the reloads made before the cycle. Eventually, after removing finitely many cycles, all 
resources that are decreased infinitely many times in Play^is, a) will be reloaded in the 
first d ■ \S | steps, and resources that are decreased finitely many times in Play a {s, a) will 
be decreased only in the first d ■ \S\ steps. 

Formally, let us fix a safe strategy cr. Denote by F a the least number of steps the 
play Play^is, a) needs to reach a point where every resource is either reloaded, or is 
never decreased in the future. If F a < h ■ \S |, we are done. So assume that /<V > d ■ \S\ 
and that cr minimizes F a among all safe strategies. 
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Consider a prefix w a = (so,ao) ■ • -(Sk, aik) of Play^is,^ of length k = Fa- As 
F, T > d ■ \S |, there must be i < j such that s, = Sj and all resources reloaded between the 
z'-th step and the j-th step of w a have already been reloaded before the z'-th step of w^. 
Denote by B the largest value to which any of the resources is reloaded between the z'-th 
and j-th steps of w a . We define choices of a new strategy, cr', for all possible histories 
w e S + : 

• If w = so ■ ■ ■ 5jf , where k' < i, then cr' chooses the same transition as cr for the history 
w, but whenever cr reloads a resource to m, cr' reloads the same resource to m + B. 

• If w = so ■ ■ ■ S(t\ ■ ■ ■ tk', then cr' chooses the same transition as cr for the history 
Sq - ■ ■ SfSi+i ■ ■ ■ Sjt\ • • ■ tie'. Moreover, whenever cr reloads a resource to a value m, cr' 
reloads the same resource to the value m. 

• For all other histories the strategy cr' chooses the same transition as cr and reloads 
resources to the same values as cr. 

It is easy to see that cr' is safe and that < F tT , a contradiction with minimality of 

Fa. 

□ 

Proof of Theorem [l6l 

First, assume that all states s satisfying Safe(s) = have been removed from the game. 
This preprocesing can be done in polynomial time for 1 -player and decreasing games 
(Theorem[T3ll. 

First, consider decreasing games. According to Lemma[14] a e Safe(s) iff there is 
a strategy cr of player □ which reloads all resources in at most d ■ \S\ steps no matter 
what player O is doing. However, existence of such a strategy can be decided using 
polynomial time bounded alternating Turing machine, which implies that the problem 
belongs to PSPACE. 

Second, consider 1 -player consumption games. According to Lemma [15] a e 
Safe(s) iff there is a strategy cr of player □ such that Play^is, a) reaches in at most 
d-\S\ steps a configuration (t,B) satisfying the following condition: there is a safe strat- 
egy cr' in (t,B) such that Play^it,^) does not decrement any recource which has not 
been reloaded in Play^s, a) before reaching (t,B). Note that existence of such cr and cr' 
can be decided in non-deterministic polynomial time as follows: first, guess a path w of 
length at most d ■ \S \ initiated in (s, a) to a configuration {t,B) (reloads do not matter too 
much here, as it suffices to always reload to 2 • d ■ \S \ ■ €). According to what resources 
have been reloaded in w, decide whether (t,B) is a safe configuration in a consumption 
game C obtained from C by pruning all transitions that decrease resources not reloaded 
in w. 

Proof of Theorem^] 

The procedure Min-dist is presented on Figure [3] 

In the following we denote by 1 the vector (!,...,!). 
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Procedure Min-dist(£),i,r) 



input : A one-player consumption game D = (S D , E D , ((S n ) D , (5 ) o ), L) with labels over 

Z< ; states s,r e S D , where r is a zero-sink 
output: The set A (s, r) 

1 foreach q e S D do d[q] <— ((oo, . . . , oo)), n[q] <— d[q] 

2 d[r] <- {(l,...,l)},n[r] <-d[r] 

3 repeat 



foreach </ e S c do <— <— 

foreach ^ e 5 D do 

let g -4 qi , . . . , q -4 ^ m be all transitions in T) outgoing from q 
if q e (5 n ) then 
for ( = 1 to m do 

| n[q] <- «[?] U (d[q,] - 5 ; ) 
end 

n[q] <— min-set (n[q]) 

else 

temp <— d[q{[ 
for i = 2 to ra do 

[ temp f-cwmCtemp.dfe] - 5,0 
end 

n[q] <— min-set (temp) 

end 

end 

20 until = d[q]for every q e S D 

21 return d[s] 



Fig. 3. Algorithm for minimal safe distances. 

Lemma 25. Let a be the length of the longest acyclic path in D. Then the procedure 
Min-dist(D, s, r) terminates after at most a iterations of the repeat-until loop. More- 
over, upon termination we have d[s] = Ao(s, r). The running time of the algorithm 
is 0(\S D | • b ■ a ■ \M\ 2k ^j where b is the branching degree of D and \M\ is the maximal 
possible size of sets n[q], d[q] and temp. 

Proof. Recall that in the following we have a consumption game C = (S ,E,(S n ,S <>),L) 
with transitions labeled only by vectors over Z<o, and zero-sink r of C. For every j < 
we say that a is a j-safe distance from some state s to r if there is a safe strategy cr 
in (s, a) such that for any strategy n of Player <> the infinite path Play ^^s, a) visits 
configuration of the form (r,/3) after at most j- transitions. We denote Safe J c (s, r) the set 
of all j-safe distances from s to r in D. Finally, we denote A J c (s, r) the set of all minimal 
elements of Safe J c (s, r). 

Claim. The following holds: 

1 . For every state s of C we have 



[{1} ifs = r 

I {(oo, . . . , oo)} otherwise 
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2. For every j > 1 and every state s of C such that s si , . . . , s — ^ s m are all transi- 
tions outgoing from s we have 

■ /min-set(US 1 (4" 1 (^.'-)-«) if*eS n 

/tp(5, — % : j J J 

lmin-set(cwm(/l^ — Si, . . . ,A C (s m ,r) — S m )) if s € So 

(1.) is trivial. In (2.), if s e S n then a e Safe J c (s,r) if and only if there 
is a transition 5— Si such that a + 6, e Safe* c l (si,r). Thus, minimal ele- 
ments of Safei,(s,r) are exactly elements of min-set((J™ i(Safe J c 7 l (si, r) - Si)) = 
min-set(U;" ! min-set^a/e^ 1 ^;, r) - Si)) = [JT=Mc ^ r ) " S M- 

Now assume that s e S<>. Note that a e SafeL(s,r) if and only if there are vec- 
tors f3\ e SafeC (si,r) — Si, . . . ,/3 m e Safe' Q l (s m , r) - S m such that a > fit for every 
1 < i < m. Obviously, this happens only if there are vectors e A J C 1 (s\,r) such 
that a > Pi for every 1 < i < m. Thus, a e Safe J c (s, r) if and only if there is some 
yS e min-set(cwm(/l^ r) - tfi, . . . ,A J C (s m , r) - S m )) such that a > /3. This immedi- 
ately implies min-set(cwm(/i^r 1 (si, r) - Si,..., A^ (s m , r) - S m )) C A J c (s, r). 

Conversely, if a e A' c (s, r), then we already know that there is e cwm(Ai~ (*i,r) - 
tfi, . . . ,A£ (i m , r) - (5,„) such that a > f3. From the previous paragraph we know that 
P € r). Since members of A J Js, r) are pairwise incomparable, we get a — ft and 
thus a e cwm(A J c (si, r) - Si, . . . , A' Q l (s m , r) - 5 m ). 

Let us now consider a computation resulting from call of Min-dist(C, s, r). For any 
q € S denote M^(q) the value of set n[q] right before the j-th iteration of the repeat-until 
loop. Then it is easy to see that M'(q) = A J c (q, r). (From associativity of operations U 
and cwm it follows that computations on lines IMTTI and IT3TIT71 correctly compute sets 

min-set(|J™ iCM- 7 '" 1 ^;) - &))) and min-set(cwm(M- M (gi) - Si M j ^ 1 (q m ) - S m )), 

respectively.) 

We will now show that for any state q we have Ac(q, r) = A%(q, r), where a is the 
length of the longest acyclic path in C. This means that the fixed point is reached after 
at most a iterations of repeat-until loop and that upon termination we have computed 
the correct set. This will finish proof of Lemma [25] 

It suffices to prove that if a is a safe distance from s to r then player □ has a strategy 
cr that is safe in (s, a) such that for any strategy n of player O the path Play^^s, a) visits 
configuration (r,{3) in at most a steps. The proof closely follows the proof of Lemma[T4l 
from section |4~T1 

Let us fix a safe strategy cr such that for any strategy n of player O the path 
Play^^s, a) visits configuration (r,/3). We will say that such strategy ensures safe visit 
ofr. Denote by F^ the number of steps the Play^^s, a) needs to reach configuration 
of the form (r,/3). Denote by the maximum max^/v^. It it is easy to see that F^ 
is finite for every cr that ensures safe visit of r (otherwise a play won by n could be 
easily constructed due to the fact that without possibility to reload the game is finitely 
branching). 

If F a < a, we are done. So assume that F a > a and that cr minimizes F a among 
all safe strategies. Given n, denote by Wa-. n the shortest prefix of Play^^s, a) on which 
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configuration of the form (t,0) appears. Denote by H a the number of distinct paths 
of length F a and assume that cr minimizes H tT among all safe strategies. We show that 
there is cr' such that either /v < F^, or = F& and H ir < < H a , a contradiction with 
minimality of Fa- and H a . 

Now observe, that if we fix the initial configuration, (s, a), then strategies can be 
seen as functions from sequences of states to transitions of C. Since labels in C are 
vectors over Z<o, from this information the players can fully determine the current con- 
figuration of the game. 

Consider one of the paths Wa-. n - (so, ao) ■ ■ ■ (sk, c?k) of length k = Fa-. As Fa- > a, 
there must be i < j such that s, = sj.We define choices of a new strategy, cr', for all 
possible histories (i.e. sequences of states visited before the choice) w e S + : 
• If w = so ■ ■ ■ sic', where k' < i, then cr' chooses the same transition as cr for the history 



• If w = so ' ' ' s ih ■ ■ ■ tk', then cr' chooses the same transition as cr for the history 

s o ' ' " ' ' ' s jh ' ' ' h'- 

• For all other histories the strategy cr' chooses the same transition as cr. 

The same argument as in proof of LemmafT4lreveals that cr' still assures safe visit of r 
from s, because for every strategy n of player O with 



there is a strategy n' such that 

Play<T,A s < a ) = ( s a> a o) ■ ■ ■ 0;. • ■ • (Sj,Pj-i)(h>a:' M )(t2, a' i+2 ) ■ ■ ■ 

with a'^ j) < ai(j) for all i, j, and s% + r for every i < k < j. 

We again have F^ < Fa-- Moreover, all plays according to cr' that start with the se- 
quence of states so ■ ■ ■ s, visit configuration of the form (r,/3) earlier than corresponding 
plays according to cr. Thus either F^ < Fa-, or F^ = Fa- and H a > < H a , a contradiction 
with minimality of F, T and H a . □ 

Complexity of Min-dist It is easy to see that the following holds at any time before the 
z'-th iteration of the repeat-until loop (even during previous iterations): for every q e S 
and every a e n[q] U d[q\0 temp we have either a = (oo, ... ,06) or a < {i •€,... ,i ■ €). 
Because the algorithm stops after at most a iterations, then size of sets n[q], d[q] and 
temp is 0(a d ■(£ + l) d ). 

The worst-case performance of the algorithm is primarily affected by computa- 
tions on lines [9] and Q3] The worst-case complexities of operations are 0(\M\) for 
M - a, 0(\M\ 2 ) for min-set(M) and 0(|Mi| • \M 2 \) for M\ U M 2 and cwm(Mi,M 2 ). 
From upper bound on size of sets involved we get that single execution of linel9lor[T5l 
takes time £)(({ + l) 2rf • a 2 ^. Thus, single iteration of the repeat-until loop takes time 
0(\S D \-b-(i+ l) 2d -a 2d ). It follows that the worst-case performance of the algorithm 



isO(\S D \-b-(e+ l) 2d -a 2d+1 ). 
Proof of Lemma IT8l 

We start by formal construction of game C(n). First we introduce some notation. 



W. 



Play a , n (s, a) = (so, a Q ) • ■ ■ («;, a{)(h, a,+i)(f 2 , a i+2 ) ■ • • 
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For infinite path p = vo ^* v\ ^ ■ ■ ■ in C and 1 < j < d we denote first(p, j) the 
least index m such that 8 m (j) = oj (or oo if 5 m (j) + to for every m). 

For any / c { 1 , . . . , d) and any vector a we define a 1 to be the vector f3 that is 
obtained from a by zeroing the ;'-th component for every i e I, i.e. a'(i) = if i e / and 
a 7 © = o-CO if i i I. 

For every / c {1, . . . , d\ we denote £7 the set of all transitions v A y' e E such that 
or(0 = for every / e {1, . . . , d}\I. We set 5/ = {* e 5 | 3f e 5 : (*, f) e £7v(f,s) e £>}. 
If (5 /, £7) is a transition system then we can define a zero-subgame induced by I to be 
the one-player game Zero{C, I) = (S /, Ej, (S D C\ S j, 0), Lg,), where Le, is restriction of 
L to E[. Note that Zero(C,<D) = C. We say that state s of C is I-safe if (S/.E/) is a 
transition system and 5a/e Zero(C/) (s) ^ 0. 

Formally, we have C(7r) = (S(;r),£ , (;r),(>S'(7r),0),L(7r)), where S(7r) = {(*, m) \ s e 
S,0 <m < d}U {r,h}, where r,h £ S and where 

£(7r) = {(0, m), (f, m)) | (s, f) € E, L(s, t) e (Z<o) rf , < m < d] 

U {((s, m), (f, m + k)) | (s, f) e E, L(s, f)(^i) = w for every m < i < m + k] 

U {(0, d-\),r)\3teS: (s, f) e E, L(s, O(^) = <W 

U {((s, m), r) | i 1 is {n\,Tii, ■ ■ ■ , n m }-safe] 

U {(0, m),h)\seS,0<m<d}U {(r, r), (ft, ft)) 



The labeling L(tt) is defined for every (v, v') e as follows: 

• if v = (s, m), V — (t,m + k) for some <m < d and < k < d - m then L(n)(v, v') = 

• if V = r then L(n)(v, v') = 0; 

• ifv'=h then L(7r)(v, v') = —1. 

Note that all labels in C{n) are vectors over Z d <0 . 
We now prove the second part of the lemma. 

It clearly suffices to prove that a e Safe{s, ii) iff a e Safe c(K y((s, 0), r). 

Suppose that a e Safe(s, n). Fix a safe strategy cr for (s, a) such that w = Play^s, a) 
matches n. Suppose that w = (s, a) ^> (si, an) ^ (S2, «2) ■ ■ ■ and let / = {i e {1, . . . ,d] \ 
Sk(i) = a) for some k > 1 }. 

Suppose that / = {1, . . . , d). Then the following sequence w' is a path in C(n): 

a* 1 

, Si "firstiw,*!) 

W = (S,0) -» • • • (^ rS f(p,^)-l, 0) -» (5^ rst(w ^,). 1) -> ■ • ■ 

c 71 ! Tm 

-> (Sfirst(w,n m )-l,m - 1) -> (i/iref^,^,), w) -> ■ • ■ 

Moreover for every 1 < ;' < J the decrease of the ;'-th resource before the first reload 
along the path w' is the same as decrease of ;-th resource before first reload along path 
w. It follows that W = Play^is, a) for a suitable safe strategy cr'. Since w' also visits 
configuration of the form (r,yS), a is a safe distance from (s, 0) to r in C(n). 
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Procedure Min-safe(C,s) 
input : A rf-dimensional one-player consumption game C such that Safe(t) + for every 

state of C, state s of C 
output: A set of all minimal elements of Safe(s) 

1 M <— 0, dist-set <— 

2 foreach n e 17(d) do 

3 construct a game C(n) with a distinguished state r 

4 dist-set <-Min-dist (C(4(s, 0),r) 

5 M <— min-set (dist-set U M) 

6 end 

7 return M 



Fig. 4. Computing minimal elements of Safe(s) 

If I + {1, . . . , d], then there must be number K such that for every j > K we have 
(5/(0 = for every i e {1, . . . ,d] \I. Moreover, K -firstiw, n\]\). But this means that the 
state sk is /-safe. Thus, the following sequence W must be a path in C(n): 

if 1 

W = (S, 0) -> • • • {SfirsKp^yi, 0) -> (^ri^Tri), 1) ~> ' ' ' 

^1 Tffl 

(Sfirst(p,n m )-\,m- 1) ^ (Sfirst{p,n m ),m) -> • ■ ■ 


-> (jjiraKp^-i. |/| -l)-»r-^r-^r--- 

In precisely same manner as above we can now conclude that a is safe distance 
from (s, 0) to r. 

On the other hand, assume that a is a safe distance from (s, 0) to r in C{n). Let 
cr be a safe strategy in (s, a) such that w = Play(s, a) reaches configuration of the 
form (r,/3). Assume that w = (s, 0) ^ (s\, i\) ■ ■ ■ (st, ik) r. Then there is a finite path 
w' = s -V ii ■ • • Sk-i in C. If we denote / the set of resources that were reloaded 
somewhere along the path w', then for every i e / the decrease of z'-th resource before 
the first reload along the path w' is the same as decrease of z'-th resource before first 
reload along the path w. For i £ I we have that decrease of the z'-th resource along the 
path w is same as decrease of this resource along the path w. Moreover, we know that 
state is /-safe. Thus the player has the following safe strategy cr' in (s, a): play along 
the path w' until its end is reached. If it is possible to reload any resource along this path, 
always reload it to value \S \ ■ d ■ M, where M — I ■ \S \ ■ d. When the play reaches state Sk, 
switch to a safe strategy for (sk, (M, . . ., M)) in game Zero(C, I). Such a strategy must 
exist due to the Corollary|7] 

Strategy cr' is safe because when i^is reached, resources in / have value at least M 
and resources not included in / are never decreased after the £-th step. Furthermore, all 
first reloads of every resource happen during first k steps. Thus the order of first reloads 
is same as in w. This proves that a € Safe(s, tt). 

Proof of TheoremQI] 

The procedure Min-safe is presented on FigureH] 
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Correctness of the algorithm follows from Lemma[T8] Indeed, if a e Safe(s), then 
(s, a) matches some permutation n and thus a e Sqfe(s,n). Moreover, if a is minimal 
element of Safe(s) then it is also minimal element of Safe(s,n) and by Lemma [T8l we 
have a e Ac(n)(s, n). This means that a is considered for addition to M at some point 
during the computation. If a is not added to M or is removed from M sometimes later, 
it means that there is a vector < a, fi + a such that /3 e A c „>((s, 0),r) for some 
permutation n' . From Lemma[18]we get that j3 e Safe(s, n'); in particular /3 e Safe(s), a 
contradiction with minimality of a. Thus, every minimal element of Safe(s) is a member 
of the returned set. 

On the other hand, assume that the returned set M contains a vector /? that is not a 
minimal element of Safe(s). We already know that the returned set contains all minimal 
elements of Safe(s) and since elements of this set are pairwise incomparable vectors, it 
follows that j6 i Safe(s). In particular (3 t Safe(s, n) for every n e 17(d). From Lemma 
[18] we have /3 t Ac(n)((s, 0), r). From correctness of Min-dist it follows that /3 is never 
considered for addition to M, a contradiction with f3 being an element of the returned set. 

We will now discuss the complexity of procedure Min-safe (including the initial 
preprocessing). The complexity of preprocessing is 0(\S \ ■ (\S \d + \E\) ■ min{|5 1, d}). For 
a fixed permutation n the state space of C(n) can be constructed in 0(\S \-d) time. In order 
to compute transitions in C(n) we need to determine whether Safe Zero(c ^ Kj })(s) £ 0, 
for every s e S and every 1 < m < d. This amounts to solving l^lii instances of 
nonemptiness problem for Streett automata, for automata with at most \S \ states and \E\ 
transitions, so the complexity of this step is 0((\S \ 2 d 2 + \E\\S \d) ■ mindS |, d\). 

The branching degree of C(n) is at most \S \ + 2 and no acyclic path in C(n) is 
longer then \S \ ■ d + 1. Also, tc{n) ^ (c- Thus, single call of Min-dist(C(7r), (s, 0), r) has 
complexity 0(\S\ 2d+1 ■ (E + \) 2d ■ d 2d+2 ). Moreover, from Theorem [T71 we get that size 
of set M never exceeds \S \ d ■ d d ■ (I + l) d , so single execution of line 5 has complexity at 
most 0(\S ^ d ■ d 2d ■ ({ + l) 2 "'). We can see that the overall worst-case running time of the 
algorithm is 0(dl ■ \S \ 2d+3 ■ {£ + \) 2d ■ d 2d+2 ). 

Proof of Theorem l20l 

We start this part by formal definition of game C. Recall that for any / c {1, ...,«/} 
and any vector a we denote a 1 the vector [5 that is obtained from a by zeroing the z'-th 
component for every i e I, i.e. a ! (i) — if i e I and a'(i) = a(i) if i i I. 
We have C = (S , E, (S □, S o), L), where 

S = {(s, I) \s 6 S, I c {1, . . . ,d}} U M, where r<£S 

E = {((s, I), (t, I)) | (s, t) e E, L(s, t) e (Z< ) d ) 

U {((s, I), (f, J)) | 0, f) e E, L(s, 00") = co for every j eJ\ 1} 

U {((s, I), r)\3teS: (s, t) e E, L(s, t)(j) = to for every ; £ {1, . . . ,d) \ 1} 

U {(is, I),h) | s€S,/c(l d}}U{(r,r)} 

and where the labeling L is defined for every (v, v') e E as follows: 
• if v = (s, /), v' = (f, /) for some I c / c {1, . . . ,d] then L(v, v') = a 1 ; 
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• if V = r then L(v, v') = 0. 

To prove the first part of the theorem, it suffices to prove that Safe c (s) = 
Safe^((s, 0), r). The equality of sets of minimal elements then follows. 

Intuitively, game C was obtained from C by performing two major modifications. 
The first modification augmented states of C with information on which resources were 
reloaded at least in the past once during. This additional information is of no help to any 
player, since the information on which resource were reloaded can be already obtained 
from the finite sequence of states visited before the player has to make the choice - and 
by definition of strategy both players always have a full knowledge of this sequence. 
The second modification - change of labeling by zeroing some components - prevents 
every resource from decreasing once it was reloaded for the first time. We claim that 
changes in resource value after this resource was reloaded for the first time do not 
influence the initial value of this resource the player □ needs to "survive." This follows 
from the fact that we assume Safe(s) + for every state s of C - for a to be the safe 
vector in s the player □ only has to enforce that z'-th resource is decreased by less than 
a(i) before it is reloaded for the first time. Once the resource can be reloaded, player □ 
can simply reload arbitrary amount of resource ;' and switch to some safe configuration. 

The discussion above intuitively reveals that Safe c (s) = Safe^is, 0). But since C is 
decreasing, then the safe strategy in ((s, 0), a) has no choice but to visit a state where the 
additional information tells us that all resources were reloaded at least once, i.e. state 
r. Since in minimum multi-distance reachability games we always have Safe(s, r) c 
Safe(s), it follows that Safe^s, 0) = Safe^((s, 0), r) and thus Safe^((s, 0), r) = Safe c (s). 

Formally^ let cr be some safe strategy in (s, a) in game C. We define a strategy 
cr for □ in C. Note that since we Jiave no reloads in C, for the fixed initial config- 
uration ((s, 0), a) all strategies in C can be viewed as functions from finite sequence 
of states to transitions in C (current configuration of the game can be always in- 
ferred from knowledge of this sequence and initial configuration). Therefore we as- 
sume that initial configuration ((so, 0), «o) with so = s and ao — a was fixed. For 
every finite sequence of states w = (so,@)(si,I\)(s2,h) •■ •(■?»!> W m C, such that 
((Si,Ii)(Si+i,Ii+i)) e E for every 1 < i < m, there is exactly one sequence of config- 
urations w' = (so, ao)(si, ai)(s2, ui) ■ • ■ (s m , a m ) such that w' is a prefix of Play an {s, a) 
for some strategy n of player O. Now there is exactly one transition of the form 
((s m , Im), iaiyv'), J)) or ((s m , I m ), r) in C and we require cr(w) to select exactly this tran- 
sition. 

We claim that <x is safe in ((so, 0), a) and that for any strategy n of O the path 
Play- n ((s(),%),a) reaches (r,/?) for some f3 (and thus a e Safe^((s,@), r)). Fix a strat- 
egy 7r of player O. Then there is strategy n' of O in C such that the decrease of ;-th 
resource before its first reload on Play tTn ,(s, a) is the same as decrease of i-th resource 
on Play~ n ({s, 0), a), for every 1 < i < d. This shows that a is safe in ((s, 0), a). Sup- 
pose that Play- n {(s(), 0), a) does not reach configuration of the form (r,/3). Then there 
is a strategy n" of player O in C such that at least one resource (say the z'-th one) is 
never reloaded on Play tTJI „(s, a). Because C is decreasing, then the ;'-th resource must 
be decreased to zero along Play^^is, a), a contradiction with cr being safe in (s, a). 

Now suppose that a e Safe^(s, 0), r). Fix a safe strategy cr in ((s, 0), a) such that 
Play^dsQ, 0), a) reaches (r,y6) (for some vector (3) for any strategy n of O in at most \S \ ■ 
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d steps. We define a safe strategy cr' in (s, a) in C. For a finite sequence of configurations 
w = (so, ao)(si,a{) ■ ■ ■ (s m , a m ) of C such that not all resources were reloaded on the 
path (so, ao) —> (s\, a\) — * ■■■ -^(s m ,a m ) we denote W the unique sequence of states 
in C such that w' = ((so, 0), aro)(Csi,/i), a\) ■ • ■ ((s„„ I m ), a' m ) for suitable Ij, a'., where 
1 < j < m. Now if cr(w') + r we require cr'(w) to select the unique transition s m A 
where q is such that cr(w') = ((q, J),f3) for some J and ji. If this transition permits reload 
of any resource, the strategy reloads it to \S \ ■ d ■ M where M = t ■ \S \ ■ d. If sigma(w') = r 
then cr'(w) selects transition s m that reloads all resources that were not reloaded so 
far (from definition of C it follows that there must be at least one such transition at this 
point), reloads all of these resources to \S | ■ d ■ M and starts to behave like a some safe 
strategy in (q, (M, . . ., M)) (such a strategy must exist due to the Corollary |7). 

We claim that strategy cr' is safe in (s, a). Since cr ensured that the play always 
reaches some configuration (r,f3), then cr' will always start to behave as some safe strat- 
egy in (q, (M, . . ., M)), for some state q, no matter what the player n does. When this 
happens, all resources have value at least M, because the switch cannot happen after 
more then \S \ ■ d steps. If there is some strategy n of player O such that Play^, n (s, a) 
visits configuration F before all resource could be reloaded, then there is also strategy 
n' of O in C such that Play^^ds,^)^) visits F, a contradiction with cr being a safe 
strategy. It follows that cr' is safe in (s, a). 

Complexity Let us now discuss the complexity of the algorithm. The preprocessing 
takes time 0(\S \ 2 ■ \E\ ■ d) and construction of C takes time 0(2 d ■ (\S \ + \E\)). The game 
C has 0(2 d ■ \S |) states and branching degree at most \S \ + 2. We also have < Iq 
(i.e., the maximal absolute value of weight in C is bounded by the maximal absolute 
weight in C). Finally, the length of every acyclic path in C is bounded by \S \ ■ d + 1, 
because from every state (t, I) we can reach only states r, h and states of the form (f, J) 
where J — I or |7| > |/|. Thus, by Theorem [17] the call Min-dist(C, (s, 0), r) takes time 
0(2 d ■ d 2d+1 ■ |5| 2fl,+3 • (I + l) 2d ). This is also the total worst case complexity of our 
algorithm. 
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